J 




Europaisches Patentamt 

European Patent Office ^ _ _ 

- ^ ^ * 0 Publication number: 0 346 710 B1 

Office europeen des brevets 



® EUROPEAN PATENT SPECIFICATION 

@ Date of publication of patent specification: 10.11.93 @ lnt.ClAC12N 15/12, 01 2N 5/10, 

C12P 21/02 

@ Application number: 89110096.8 
(§) Date of filing: 03.06.89 



@ cDNAs coding for members of the carcinoembryonic antigen family. 



CO 
CO 



® Priority: 16.06.88 US 207678 




Proprietor: MILES INC. 


21.11.88 US 274107 




One Mellon Center 






500 Grant Str. 


@ Date of publication of application: 




Pittsburgh, PA 15219-2502(US) 


20.12.89 Bulletin 89/51 








© 


Inventor: Barnett, Thomas R., Dr. 


© Publication of the grant of the patent: 




27 Jeffrey Road 


10.11.93 Bulletin 93/45 




East Haven, CT 0651 3(US) 






Inventor: Elting, James J., Dr. 


@ Designated Contracting States: 




5 Heatherwood Drive 


AT BE CH DE ES FR GB GR IT U NL SE 




Madison, CT 06443(US) 






Inventor: Kamarck, Michael E. 


@ References cited: 




86 Russell Road 


EP-A- 263 933 




Bethany, CT 06525(US) 


EP-A- 0 212 880 




Inventor: Kretschmer, Axel, Dr. 






RIchard-Zorner-Strasse 32 


BIOCHEM. BIOPHYS. RES. COMMUN., vol. 




D-5060 Bergisch Gladbach 1(DE) 


142, no. 2, 30th January 1987, pages 511-518; 






R. OIKAWA et al.: "Primary structure of hu- 






man carcinoembryonic antigen (CEA) de- 


0 Representative: Danner, Klaus, Dr. et al 


duced from cDNA sequence" 




Bayer AG 






Konzernverwaltung RP 


IVIOL CELL BIOL., vol. 7, 1987, page 




Patente Konzern 


3221-3230; R. BEAUCHEMIN et al.: "Isolation 




D-51368 Leverkusen (DE) 


and characterization of full-lenght functional 






cDNA clones for human carcinoembryonic 






antigen" 







Note: Within nine months from the publication of the mention of the grant of the European patent, any person 

may give notice to the European Patent Office of opposition to the European patent granted. Notice of opposition 
shall be filed in a written reasoned statement. It shall not be deemed to have been filed until the opposition fee 
has been paid (Art. 99(1) European patent convention). 

Rank Xerox (UK) Business Sendees 



4i 



EP 0 346 710 B1 



PROC. NATL ACAD. SCI. USA, vol. 85, Sep- 
tember 1988, pages 6959-6963; Y. HINODA et 
al.: "Molecular cloning of a cDNA coding 
biliary glycoprotein I: primary structure of a 
glycoprotein immunologically crossreactlve 
with carclnoembryonic antigen" 

GENE, vol. 71, no. 2, November 1988, pages 
439-449; B.C. ROONEY et al.: "Molecular 
cloning of a cDNA for human pregnancy- 
specific B1 -glycoprotein: homology with hu- 
man carclnoembryonic antigen and related 
proteins" 



EP 0 346 710 B1 



Description 

BACKGROUND OF THE INVENTION 
6 Field of the Invention 

The present invention concerns nucleic acid sequences which code for carcinoembryonic antigen 
(CEA) antigen family peptide sequences. 

70 Background Information 

Carcinoembryonic antigen was first described by Gold and Freedman. J. Exp. Med., 121, 439-462, 
(1965). CEA Is characterized as a glycoprotein of approximately 200,000 molecular weight with 50-60% by 
weight of carbohydrate. CEA is present during normal human fetal development, but only in very low 
75 concentration in the normal adult intestinal tract. It is produced and secreted by a number of different 
tumors. 

CEA is a clinically useful tumor marker for the management of colorectal cancer patients. CEA can be 
measured using sensitive immunoassay methods. When presurgical serum levels of CEA are elevated, a 
postsurgical drop in serum CEA to the normal range typically Indicates successful resection of the tumor. 

20 Postsurgical CEA levels that do not return to normal often indicate incomplete resection of the tumor or the 
presence of additional tumor sites in the patient. After returning to normal levels, subsequent rapid rises in 
serum CEA levels usually Indicate the presence of metastages. Slower postsurgical rises from the normal 
level are most often interpreted to Indicate the presence of new primary tumors not previously detected. 
Post surgical management of colon cancer patients is thus facilitated by the measurement of CEA. 

25 CEA is a member of an antigen family. Because of this, the immunoassay of CEA by presently 
available methods is complicated by the fact that CEA is but one of several potentially reactive antigens. 
There have been at least sixteen CEA-like antigens described in the literature. Since some of these appear 
to be the same antigen described by different investigators, the actual number of different antigens is 
somewhat less than this number. Nonetheless, there is a complex array of cross-reactive antigens which 

30 can potentially Interfere with an Immunoassay of the CEA released by tumors. It is known that serum levels 
of CEA-like antigens are elevated in many non-cancerous conditions such an inflammatory liver diseases 
and also in smokers. It is important that immunoassays used for the monitoring of cancer patient status not 
be Interfered with by these other CEA-like antigens. Conversely, it Is important to be able to distinguish the 
antigens by immunoassays because of the possibility that different tumor types may preferentially express 

35 different forms of CEA. If so, then the ability to reliably measure the different forms of CEA can provide the 
means to diagnose or more successfully treat different forms of cancer. 

The members of the "CEA family" share some antigenic determinants. These common epitopes are not 
useful in distinguishing the members of the antigen family and antibodies recognizing them are of little use 
for measuring tumor-specific CEA levels. 

40 U.S.P. 3,663,684, entitled "Carcinoembryonic Antigen and Diagnostic Method Using Radioactive Io- 
dine", concerns purification and radioiodlnatlon of CEA for use In a RIA. 

U.S.P. 3,697,638 describes that CEA Is a mixture of antigens (components A and B in this case). U.S.P. 
3,697,638 mentions methods for separating and radloiodinating each component and their use in specific 
RIA'S. 

45 U.S.P. 3,852,415, entitled "Compositions for Use in Radioimmunoassay, as Substitute for Blood Plasma 
Extract in Determination of Carcinoembryonic Antigen" relates to the use of a buffer containing EDTA and 
bovine serum albumin as a substitute for plasma as a diluent for CEA RIA's. 

U.S.P. 3,867,363, entitled "Carcinoembryonic Antigens", is directed to the isolation of CEA components 
A and B, their labelling and use in a RIA. 
50 U.S.P. 3,927,193, entitled "Localization of Tumors by Radlolabelled Antibodies", concerns the use of 
radiolabelled anti-CEA antibodies in whole body tumor imaging. 

U.S.P. 3,956,258, entitled "Carcinoembryonic Antigens", relates to the isolation of CEA components A 
and B. 

U.S.P. 4,086,217, entitled "Carcinoembryonic Antigens", is directed to the isolation of CEA components 
55 A and B. 

U.S.P. 4,140,753, entitled "Diagnostic Method and Reagent", concerns the purification of a CEA isomer 
called CEA-S1 and its use in a RIA. 

U.S.P. 4,145,336, entitled "Carcinoembryonic Antigen Isomer", relates to the antigen CEA-SI. 
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U.S.P. 4.180,499. entitled "Carcinoembryonlc Antigens", describes a process for producing CEA 
component B. 

U.S.P. 4,228,236. entitled "Process of Producing Carcinoennbryonic Antigen", is directed to the use of 
the established cell lines LS-174T and LS-180 or clones or derivatives thereof for the production of CEA. 

5 U.S.P. 4,272,504. entitled "Antibody Adsorbed Support Method for Carcinoennbryonic Antigen Assay", 
concerns two concepts for the radioimmunoassay of CEA. First. U.S.P. 4,272,504 relates to a sample 
pretreatment in the form of heating to 65 to 85 • C at pH 5 to precipitate and eliminate extraneous protein. 
Second. It describes the use of a solid phase antibody (either on beads or tubes) as a means to capture 
analyte and radiolabelled CEA tracer. 

10 U.S.P. 4,299,815, entitled "Carcinoembryonic Antigen Determination", concerns diluting a CEA sample 
with water and pretreating by heating to a temperature below which precipitation of protein will occur. The 
pretreated sample is then immunoassayed using RIA, EIA, FIA or chemiluminescent immunoassay. 

U.S.P. 4,349,528, entitled "Monoclonal Hybridoma Antibody Specific for High Molecular Weight Car- 
cinoembryonic Antigen", is directed to a monoclonal antibody reacting with 180 kD CEA, but not with other 

75 molecular weight forms. 

U.S.P. 4,467,031, entitled "Enzyme-lmmunoassay for Carcinoembryonic Antigen", relates to a sandwich 
enzyme immunoassay for CEA in which the first of two anti-CEA monoclonal antibodies is attached to a 
solid phase and the second monoclonal is conjugated with peroxidase. 

U.S.P. 4,489.167, entitled "Methods and Compositions for Cancer Detection", describes that CEA 

20 shares an antigenic determinant with alpha-acid glycoprotein (AG), which is a normal component of human 
serum. The method described therein concerns a solid-phase sandwich enzyme immunoassay using as one 
antibody an antibody recognizing AG and another antibody recognizing CEA, but not AG. 

U.S.P. 4,578,349. entitled "Immunoassay for Carcinoembryonic Antigen (CEA)", Is directed to the use of 
high salt containing buffers as diluents in CEA immunoassays. 

25 EP 113072-A, entitled "Assaying Blood Sample for Carcinoembryonic Antigen - After Removal of 
Interfering Materials by Incubation with Silica Gel", relates to the removal from a serum of a plasma sample 
of interfering substances by pretreatment with silica gel. The precleared sample is then subjected to an 
Immunoassay. 

EP 102008-A, entitled "Cancer Diagnostics Carcinoembryonic Antigen - Produced from Perchloric Acid 
30 Extracts Without Electrophoresis", relates to a procedure for the preparation of CEA from perchloric acid 
extracts, without the use of an electrophoresis step. 

EP 92223-A, entitled "Determination of Carcinoembryonic Antiyen in Cytosol or Tissue - for Therapy 
Control and Early Recognition of Regression", concerns an immunoassay of CEA, not in serum or plasma, 
but In the cytosol fraction of the tumor tissue itself. 
35 EP 83103759.6, entitled "Cytosole-CEA-Measurement as Predictive Test in Carcinoma, Particularly 
Mammacarcinoma". is similar to EP 92223-A. 

EP 83303759, entitled "Monoclonal Antibodies Specific to Carcinoembryonic Antigen", relates to the 
production of "CEA specific" monoclonal antibodies and their use In Immunoassays. 

WO 84/02983. entitled "Specific CEA-Famlly Antigens, Antibodies Specific Thereto and Their Methods 
40 of Use", is directed to the use of monoclonal antibodies to CEA-meconium (MA)-, and NCA-specific 
epitopes In Immunoassays designed to selectively measure each of these individual components in a 
sample. 

All of the heretofore CEA assays utilize either monoclonal or polyclonal antibodies which are generated 
by Immunizing animals with the intact antigen of choice. None of them address the idea of making 
45 sequence specific antibodies for the detection of a unique primary sequence of the various antigens. They 
do not cover the use of any primary amino acid sequence for the production of antibodies to synthetic 
peptides or fragments of the natural product. They do not include the concept of using primary amino acid 
sequences to distinguish the CEA family members. None of them covers the use of DNA or RNA clones for 
isolating the genes with which to determine the primary sequence. 

60 



55 
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. DEFINITIONS 
Nucleic Acid Abbrevia tions 

adenine 
guanine 
cytosine 
thymidine 
uracil 

Amino Acid Abbreviations: 



A QD 




Asn 


asoaracri ne 


Thr 


tbiTfionine 

bAA^ AAA W 


Ser 


serine 


Glu 


Glutamic acid 


Gin 


^ X U C GiMIX AIG 


irro 


proline 


Gly 


glycine 


Ala 


alanine 


Cys 


cysteine 


Val 


valine 


Met 


methionine 


lie 


isoleucine 


Leu 


leucine 


Tyr 


tyrosine 


Phe 


phenylalanine 


Trp 


tryptophan 


Lys 


lysine 


His 


histidine 


Arg 


arginine 



45 

Nucleotide - A monomeric unit of DNA or RNA containing a sugar moiety (pentose), a phosphate, and a 
nitrogenous heterocyclic base. The base is linked to the sugar moiety via the glycosidic carbon (1 ' carbon 
of the pentose) and that combination of base and sugar is called a nucleoside. The base characterizes the 
nucleotide. The four DNA bases are adenine ("A"), guanine ("G"), cytosine ("C"), and thymine ("T"). The 
50 four RNA bases are A, G, C and uracil ("U"). 

DNA Sequence - A linear array of nucleotides connected one to the other by phosphodiester bonds 
between the 3' and 5' carbons of adjacent pentoses. 

Functional equivalents - It Is well known in the art that in a DNA sequence some nucleotides can be 
replaced without having an influence on the sequence of the expression product. With respect to the 
55 peptide this term means that one or more amino acids which have no function in a particular use can be 
deleted or replaced by another one. 

Codon - A DNA sequence of three nucleotides (a triplet) which encodes through mRNA an amino acid, 
a translation start signal or a translation termination signal. For example, the nucleotide triplets TTA, TTG. 



10 



A 

G 
C 
T 
U 
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CTT. CTC. CTA and CTG encode the amino acid leucine ("Leu"). TAG. TAA and TGA are translation stop 
signals and ATG is a translation start signal. 

Reading Frame - The grouping of codons during translation of mRNA into amino acid sequences. 
During translation, the proper reading frame must be maintained. For example, the sequence 
5 GCT6GTTGTAAG may be translated in three reading frames or phases, each of which affords a different 
amino acid sequence 

GCT GGT TGT AAG - Ala-GlyCys-Lys 

G CTG GTT GTA AG - Leu-Val-Val 

GC TGG TTG TAA G - Trp-Leu- (STOP) . 

Polypeptide - A linear array of amino acids connected one to the other by peptide bonds between the 
75 alpha-amino and carboxy groups of adjacent amino acids. 

Genome - The entire DNA of a cell or a virus. It includes inter alia the structural genes coding for the 
polypeptides of the cell or virus, as well as its operator. promoter~and ribosome binding and interaction 
sequences, including sequences such as the Shine-Dalgarno sequences. 

Structural Gene - A DNA sequence which encodes through Its template or messenger RNA ("mRNA") a 
20 sequence of amino acids characteristic of a specific polypeptide. 

Transcription - The process of producing mRNA from a structural gene. 
Translation - The process of producing a polypeptide from mRNA. 

Expression - The process undergone by a structural gene to produce a polypeptide. It is a combination 

of transcription and translation. 
25 Plasmid - A non-chromosomal double-stranded DNA sequence comprising an intact "replicon" such 

that the plasmid is replicated in a host cell. When the plasmid is placed within a unicellular organism, the 

characteristics of that organism may be changed or transformed as a result of the DNA of the plasmid. For 

example, a plasmid carrying the gene for tetracycline resistance (Tet*^) transforms a cell previously sensitive 

to tetracycline into one which is resistant to it. A cell transformed by a plasmid is called a "transformant". 
30 Phage or Bacteriophage - Bacterial virus, many of which consist of DNA sequences encapsulated in a 

protein envelope or coat ("capsid protein"). 

Cloning Vehicle - A plasmid, phage DNA or other DNA sequence which is capable of replicating in a 

host cell, which is characterized by one or a small number of endonuclease recognition sites at which such 

DNA sequences may be cut in a determinable fashion without attendant loss of an essential biological 
35 function of the DNA, e.g., replication, production of coat proteins or loss of promoter or binding sites, and 

which contains a marker suitable for use in the identification of transformed cells, e.g., tetracycline 

resistance or ampicillin resistance. A cloning vehicle is often called a vector. 

Cloning - The process of obtaining a population of organisms or DNA sequences derived from one such 

organism or sequence by asexual reproduction. 
40 Recombinant DNA Molecule or Hybrid DNA - A molecule consisting of segments of DNA from different 

genomes which have been joined end-to-end outside of living cells and have the capacity to infect some 

host cell and be maintained therein. 

cDNA Expression Vector - A procaroytic cloning vehicle which also contains sequences of nucleotides 

that facilitate expression of cDNA sequences in eucaroytic cells. These nucleotides include sequences that 
45 function as eucaryotic promoter, alternative splice sites and polyadenylation signals. 

Transformation/Transfection - DNA or RNA is introduced into cells in such a way as to allow gene 

expression. "Infected" referred to herein concerns the introduction of RNA or DNA by a viral vector into the 

host. 

"Injected" referrred to herein concerns the microinjection (use of a small syringe) of DNA into a cell. 
50 CEA antigen family (CEA gene family) - a set of genes (gene family) and their products (antigen 
family) that share nucleotide sequences homologous to partial cDNA LV-7 (CEA-(a)) and as a result of 
theses similarities also share a subset of their antigenic epitopes. Examples of the CEA antigen family 
include CEA ( = CEA-(b)), transmembrane CEA (TMCEA) = CEA-(c) and normal crossreacting antigen NCA 
( = CEA-(d)). 

55 
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SUMMARY OF THE INVENTION 

The present invention concerns the following DNA sequences designated as TM-2 (CEA-(e)), TM-3 
(CEA-(f)), TM-4 (CEA-(g)), KGCEA1 and KGCEA2, which code for CEA antigen family peptide sequences: 

5 

SEQUENCE AND TRANSLATION OF CDNA OF TM-2 

70 

10 30 50 

CAGCCGTGCTCGAACCCTTCCTGGACCCCAAGCTCTCCTCCACAGCTGAACACACCGCCA 

70 90 110 

GCAGGAGACACeATGGGGCACCTCTCAGCCCCACTTCACAGAGTGCGTGTACCCTGGCAG 
MetGlyHisLeuSerAlaPcoLeuHisArgValArgValProTrpGln 

20 

130 150 170 

GGGCTTCTGCTCACAGCCTCACTTCTAACCTTCTGGAACCCGCCCACCACTGCCCAGCTC 
GlyLeuLeuLeuThcAlaSe cLeuLeuThrPheTr pAsn Prop roThr The AlaGlnLeu 

25 

190 210 230 

ACTACTGAATCCATGCCATTCAATGTTGCAGAGGGGAAGGAGGTTCTTCTCCTTGTCCAC 
ThrThcGluSe cMe tProPhcAsnValAlaGluGlyLysGluValLeuLeuLeuValHi s 

30 

250 270 290 

AATCTCCCCCAGCAACTTTTTGGCTACAGCTGGTACAAAGGGGAAAGAGTGGATGGCAAC 
AsnLeuProGlnGlnLeuPheGlyTyrSerTrpTyrLysGlyGluAcgValAspGlyAsn 

310 330 350 

CGTCAAATTGTAGGATATGCAATAGGAACTCAACAAGCTACCCCAGGGCCCGCAAACAGC 
40 ArgGlnlleValGlyTyrAlalleGlyThrGlnGlnAlaThrPcoGlyProAlaAsnSe r 

370 390 410 

GGTCGAGAGACAATATACCCCAATGCATCCCTGCTGATCCAGAACGTCACCCAGAATGAC 
^ GlyArgGluThrlleTy rPfoAsnAlaSe cLeuLeuIleGlnAsnValThcGlnAsnAsp 

430 450 470 

go ACACCATTCTAfcACCCTACAAGTCATAAACTCAGATCTTGTGAATGAAGAAGCAACTGCA 
ThrClyPheTy cTh r LeuClnVal I leLy sSc cAspLeuValAsnGluGluAlaTh cGly 



55 
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490 510 530 

CAGTTCCATGTATACCCGGAGCTGCCCAAGCCCTCCATCTCCAGCAACAACTCCAACCCT 
GlnPheHisValTycProGluLcuProLysProScrlleSerSerAsnAsnSerAsnPro 



550 570 590 

GTGGAGGACAAGGATGCTGTGGCCTTCACCTGTGAACCTGAGACTCAGGACACAACCTAC 
ValGluAspLysAspAldValAlaPhcThrCysGluPcoGluThrGlnAspThrThrTyr 



610 630 650 

• • • • ' • . 

CTGTGGTGGATAAA"CAATCACAGCCTCCCGGTCAGTCCCAGGCTCCAGCTGTCCAATGGC 
LeuTrpTrpI leAsnAsnGlnSe rLeuProValSe cProArgLcuGlnLeuSe cAsnGly 



670 690 710 

AACAGGACCCTCACTCTACTCAGTGTCACAAGGAATGACACAGGACCCTATGAGTGTGAA 
AsnAcgThr LeuTh cLeuLeuSe r Va 1 Th r AcgAsnAspTh rGly P r oTy rGluCy sGlu 



730 750 770 

ATACAGAACCCAGTGAGTGCGAACCGCAGTGACCCAGTCACCTTCAATGTCACCTATGGC 
I IcGlnAsnProValSe cAlaAsnAcgSecAspPcoValThrLeuAsnValThrTy rCly 



790 810 830 

CCGGACACCCCCACCATTTCCCCTTCAGACACCTATTACCGTCCACGGCCAAACCTCAGC 
ProAspThrProThrlleSerProSerAspThrTyrTy cArgProGlyAlaAsnLeuSer 

850 870 890 

CTCTCCTGCTATGCAGCCTCTAACCCACCTGCACAGTACTCCTGGCTTATCAATGCAACA 
LeuSerCysTycAlaAlaSerAsnProProAlaClnTycSecTrpLeuIleAsnGlyThf 



910 930 950 

TTCCAGCAAAGCACACAAGAGCTCTTTATCCCTAACATCACTGTGAATAATAGTCGATCC 
PhcGlnGlnSe rThrGlnGluLeuPhellePcoAsnlleThcValAsnAsnSe rGlySe r 



970 990 1010 

TATACCTGCtACGCCAATAACTCAGTCACTGGCTGCAACAGGACCACAGTCAACACGATC 
Ty cThrCysHi sAl a AsnAsnSe rValThcCl yCys AsnAcgThr ThrValLy sThr I le 
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1030 1050 1070 

ATAGTCACTGATAATGCTCTACCACAAGAAAATGGCCTCTCACCTGGGGCCATTGCTGGC 
g I leValThcAspAsnAlaLeuProGlnGluAsnGlyLeuSerProGlyAlalleAlaGly 

1090 1110 1130 

ATTCTGATTGGAGTACTGGCCCTGGTTGCTCTGATAGCAGTAGCCCTGGCATGTTTTCTG 
10 IleVdllleGlyValValAlaLeuValAlaLeuIleAlaValAlaLeuAlaCysPheLeu 

1150 1170 1190 



75 



CATTTCGGG.'UVGACCGGCAGGCCAAGCGACCAGCGTGATCTCACAGAGCACAAACCCTCA 
Hi sPheGlyLysThcGlyArgAlaSe cAspGlnArgAspLeuThcGluHi sLysProSe r 

1210 1230 1250 

20 GTCTCCAACCACACTCAGGACCACTCCAATGACCCACCTAACAAGATGAATGAACTTACT 
ValSerAsnKisThrGlnAspHisSerAsnAspProProAsnLysMe tAsnCluValThr 

1270 1290 1310 

25 TATTCTACCCTGAACTTTGAAGCCCAGCAACCCACACAACCAACTTCAGCCTCCCCATCC 
Ty rS*^ rThrLeuAsnPheGluAlaGlnGlnProThrGlnProThcSe rAlaSerProSec 



30 



35 



40 



1330 1350 1370 

CTAACAGCCACAGAAATAATTTATTCAGAAGTAAAAAAGCAGTAATGAAACCTGTCCTCC 
LeuThrAlaThrGluIlcIleTyrSerGluValLysLysGln 



1390 1410 1430 

TCACTGCAGTGCTGATGTATTTCAAGTCTCTCACCCTCATCACTAGGAGATTCCTTTCCC 

1450 1470 1490 

CTGTACGGTAGAGGGGTGGGGACAGAAACAACTTTCTCCTACTCTTCCTTCCTAATAGGC 



1510 1530 1550 

• • • • • • 

ATCTCCAGGCTGCCTCGTCACTGCCCCTCTCTCAGTGTCAATAGATCAAACTACATTGGC 

1570 1590 1610 

AGTCTGTAGCAAACCCAACCTTCTTGTCATTCAAATTTGGCAAAGCTGACTTTCGGAAAG 

50 
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1630 1650 1670 

ACGGACCAGAACTTCCCCTCCCTTCCCCTTTTCCCAACCTGGACTTGTTTT/u^ACTTGCC 

1650 1710 1730 

TGTTCAGAGCACTCATTCCTTCCCACCCCCAGTCCTGTCCTATCACTCTA/.TTCGCATTT 

^''SO 1770 1790 

CCCATACCCTTGAGGTTATCTCCTTTTCCATTAACTACATGTCCCAGCAAACACCGACAC 

1810 1831). 1850 

AGACAAAGTAAACGGCACTAATCCTTCTCCTATTTCTCCAAAGCCTTCTGTGAACTAGCA 

1870 1890 1910 

AAGAGAACAAAATCAAATATATAACCAATACTGAAATGCCACAGGTTTGTCCACTGTCAC 

1530 1950 1970 

GGTTCTCTACCTGTAGGATCAGGGTCTAAGCACCTTGGTGCTTAGCTAGAATACCACCTA 

1950 2010 2030 

ATCCTTCTGGCAAGCCTGTCTTCAGAGAACCCACTAGAAGCAACTAGGAAAAATCACTTG 

2050 2070 2090 

CCAAAATCCAACGCAATTCCTGATCG/'JVAATGCAAAAGCACATATATGTTTTAATATCTT 

2110 2130 2150 

TATGGGCTCTCTTCAAGGCAGTGCTGAGAGGGAGGGGTTATAGCTTCAGCAGGGAACCAG 

2170 2190 2210 

CTTCTCATAAACACAATCTCCTAGGAACTTGGGAAAGCAATCAGAGAGCTGCCCTTCACC 
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2230 2250 2270 

CATTATTTAAATTCTTAAAGAATACAC^ATTTGGCGTATTCGCATTTTTCTCCTTTTCTC 

2290 2310 2330 

TGAGACATTCCACCATTTTAATTTTTGTAACTGCTTATTTATGTGAAAu^GGGTTATTTTT 

2350 2370 2390 

ACTTAGCTTAGCTATGTCAGCCAATCCGATTGCCTTAGGTGAAAGAAACCACCGAAATCC 

»5 24 10 2430 2450 

CTCAGGTCCCTTGGTCAGGAGCCTCTCAAGATTTTTTTTGTCAGAGGCTCCAAATAGAAA 



70 



20 



25 



30 



40 



45 



2470 2490 2510 

ATAAGAAAACGTTTTCTTCATTCATGGCTAGAGCTACATTTAACTCAGTTTCTAGGCACC 

2530 2550 2570 

TCACACCAATCATCAACTACCATTCTATTCCATCTTTGCACCTGTGCATTTTCTCTTTGC 

2590 2610 2630 

CCCCATTCACTTTGTCAGGAAACCTTCGCCTCTGCTAAGGTGTATTTGGTCCTTGAGAAG 



2650 2670 2690 

35 TGGGAGCACCCTACAGGGACACTATCACTCATGCTGGTGGCATTCTTTACAGCTAGAAAC 

2710 2730 2750 

CTGCACTGGTGCTAATGCCCCTTGGGAAATGGGGCTGTGAGGAGGAGGATTATAACTTAG 



2770 2790 2010 

GCCTAGCCTCTTTTAACACCCTCTGAAATTTATCTTTTCTTCTATGGGGTCTATAAATCT 

2030 2850 2870 

ATCTTATAATAAJCAACCAACQACAGCACGAAGACACGCAAATGTACTTCTCACCCAGTCT 



50 
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2890 2910 2930 

. • • * ' 

TCTACACAGATCGAATCTCTTTCGGGCTAACAGAAAGGTTTTATTCTATATTGCTTACCT 

2950 2970 2990 

GATCTCATGTTAGGCCTAAGACCCTTTCTCCAGGAGGATTAGCTTCCACTTCTCTATACT 

3010 3030 3050 

CAGGTACCTCTTTCACGGTTTTCTAACCCTCACACGCACTGTCCATACTTTCCCTCATCC 

3070 3090 3110 

ATCCTGTCCTGTGTTATTTAATTTTTCCTGGCTAAGATCATCTCTGAATTATGTATGAAA 

3130 3150 3170 

ATTATTCTATGTTTTTATAATAAAAATAATATATCAGACATCGAAAAAAAAAA 
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SEQUENCE AND TRANSLATION OF cDNA OF TK-3 



10 30 50 

CAGCCGTGCTCGAAGCGTTCCTGGAGCCCAAGCTCTCCTCCACAGGTGAAGACAGCGCCA 

70 90 ■ 110 

GCAGGAGACACCATGGGGCACCTCTCAGCCCCACTTCACAGAGTGCGTGTACCCTGGCAG 
Me tGlyHi sLeuSe rAlaPcoLeoHisArgValArgValProTrpGln 

130 150 170 

GGGCTTCTGCTCACAGCCTCACTTCTAACCTTCTGGAACCCGCCCACCACTGCCCAGCTC 
GlyLeuLeuLeuThrAlaSe rLeuLeuThrPheTrpAsnProProThrThrAlaGlnLeu 

190 210 230 

ACTACTGAATCCATGCCATTCAATGTTGCAGAGGGGAAGGAGGTTCTTCTCCTTGTCCAC 
ThrThcGluSerMetProPheAsnValAlaGluGlyLysGluValLeuLeuLeuValHis 

250 270 290 

AATCTGCCCCAGCAACTTTTTGGCTACAGCTGGTACAAAGGGGAAAGAGTGGATGGCAAC 
AsnLeuProGlnGlnLeuPheGlyTy rSe rTcpTy rLysGlyGluArgValAspGlyAsn 

310 330 - 350 

CGTCAAATTGTAGGATATGCAATAGGAACTCAACAAGCTACCCCAGGGCCCGCAAACAGC 
ArgGlnlleValGlyTyrAlalleGlyThrGlnGlnAlaThrProGlyProAlaAsnSer 

370 390 410 

GGTCGAGAGACAATATACCCCAATGCATCCCTGCTGATCCAGAACGTCACCCAGAATGAC 
GlyArgGluThrlleTy rProAsnAlaSerLeuLeuIleGlnAsnValThrGlnAsnAsp 
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430 450 470 

ACAGGATTCTACACCCTACAAGTCATAAAGTCAGATCTTGTGAATGAAGAAGCAACTGGA 
ThrGlyPheTy cThrLeuGlnVallleLysSerAspLeuValAsnGluGluAlaThrGly 



490 510 530 

CAGTTCCATGTATACCCGGAGCTGCCCAAGCCCTCCATCTCCAGCAACAACTCCAACCCT 
GlnPheHisValTyrPro'GluLeuProLysProSerlleSerSerAsnAsnSerAsnPro 



550 570 590 

• • • • , , 

GTGGAGGACAAGGATGCTGTGGCCTTCACCTGTGAACCTGAGACTCAGGACACAACCTAC 
ValGluAspLysAspAlaValAlaPheThrCysCluPcoGluThrGlnAspThrThrTyr 



610 630 650 

CTGTGGTGGATAAACAATCAGAGCCTCCCGGTCAGTCCCAGGCTGCAGCTGTCCAATGGC 
LeuTrpTcpIleAsnAsnGlnSe rLeuProValSe rProArgLeuGlnLeuSe rAsnGly 



670 690 710 

AJJ^CAGGACCCTCACTCTACTCAGTGTCACAAGGAATGACACAGGACCCTATGAGTGTGAA 
AsnArgThrLeuThrLeuLeuSe cValThcAcgAsnAspThrGlyProTy cGluCysGlu 



730 750 770 

ATACAGAACCCAGTGAGTGCGAACCGCAGTGACCCAGTCACCTTGAATGTCACCTATGGC 
IleGlnAsnProValSerAlaAsnArgScrAspProValThrLeuAsnValThrTycXJly 



790 810 830 

• • • • > . 

CCGGACACCCCCACCATTTCCCCTTCAGACACCTATTACCGTCCAGGGGCAAACCTCAGC 
ProAspThrProThrlleSerProSerAspThrTycTyrArgProGlyAlaAsnLeuSer 
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850 870 890 

CTCTCCTGCTATGCAGCCTCTAACCCACCTGCACAGTACTCCTGGCTTATCAATGGAACA 
LeuSerCysTyrAlaAlaSe rAsnProProAlaGlnTyrSerTrpLeuIleAsnGlyThr 

910 930 950 

■ _ • • • • • 

TTCCAGCAAAGCACACAAGAGCTCTTTATCCCTAACATCACTGTGAATAATAGTGGATCC 
PheGlnGlnSerThrGlnGluLeuPhelleProAsnlleThrValAsnAsnSerGlySer 

970 990 1010 

TATACCTGCCACGCCAATAACTCAGTCACTGGCTGCAACAGGACCACAGTCAAGACGATC 
Ty rThrCysHi sAlaAsnAsnSe rValThrGlyCysAsnArgThrThrValLysThrl le 

1030 1050 1070 

ATAGTCACTGAGCTAAGTCCAGTAGTAGCAAAGCCCCAAATCAAAGCCAGCAACACCACA 
IleValThrGluLeuSe rProValValAlaLysProGlnlleLysAlaSerLysThrThr 

1090 1110 1130 

GTCACAGGAGATAAGGACTCTGTGAACCTGACCTGCTCCACAAATGACACTGGAATCTCC 
ValThrGlyAspLysAspSe rValAsnLeuThrCysSe cThrAsnAspThrGlyl leSe r 

1150 1170 1190 

ATCCGTTGGTTCTTCAAAAACCAGAGTCTCCCGTCCTCGGAGAGGATGAAGCTGTCCCAG 
lleAcgTrpPhePheLysAsnGlnSerLeuProSerSerGluArgNetLysLeuSerGln 

1210 1230 1250 



GGCAACACCACCCTCAGCATAAACCCTGTCAAGAGGGAGGATGCTGGGACGTATTGGTGT 
GlyAsnThrThrLeuSerlleAsnProValLysArgGluAspAlaGlyThrTy rTrpCys 
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1270 1290 1310 

GAGGTCTTCAACCCAATCACTAAGAlACCAAAGCGACCCCATCATGCTGAACGTAAACTAT 
CluValPheAsnProIleSe cLysAsnGlnSerAspProIleMetLeuAsnValAsnTyr 

1330 1350 1370 

AATGCTCTACCACAAGAAAATGGCCTCTCACCTGGGGCCATTGCTGGCATTGTGATTGGA 
AsnAlaLeuProGlnGlUAsnGlyLeuSerProGlyAlalleAlaGlylleVallleGly 

1390 1410 1430 

• ••••• 

GTAGTGGCCCTGGTTGCTCTGATAGCAGTAGCCCTGGCATGTTTTCTGCATTTCGGGAAG 
ValValAlaLeuValAlaLeuIleAlaValAlaLeuAlaCysPheLeuHisPheGlyLys 

1450 1470 1490 

ACCGGCAGCTCAGGACCACTCCAATGACCCACCTAACAAGATGAATGAAGTTACTTATTC 
ThrGlySe rSe rClyProLeuGln 

1510 1530 1550 

• ••••• 
TACCCTGAACTTTGAAGCCCAGCAACCCACACAACCAACTTCAGCCTCCCCATCCCTAAC 

1570 1590 1610 

AGCCACAGAAATAATTTATTCAGAAGTAAAAAAGCAGTAATGAAACCTGAAAAAAAAAAA 



1630 
AAAAAAAAAA 
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SEQUENCE AND TRANSLATION OF cDNA OF TM-4 

10 30 5b 

CACCCGTGCTCCAACCGTTCCTGCAGCCCAAGCTCTCCTCCACACCTCAAGACAGGGCCA 



70 90 



110 



GCAGGAGACACCATGGGGCACCTCTCAGCCCCACTTCACAGAGTGCGTGTACCCTGGCAG 
MetOlyHisLeuSerAlaPcoLeuHisAcgValArgValProTrpGln 

130 - 150 170 

CCGCTTCTCCTCACACCCTCACTTCTAACCTTCTGGAACCCGCCCACCACTGCCCACCTC 
ClyLeuLeuLeuThcAlaSerLeuLeuThrPheTrpAsnProPcoThrThrAUctnLeu 



150' 210 



230 



ACTACTCAATCCATCCCATTCA/vTGTTGCAGAGGGGAAGGAGGTTCTTCTCCTTGTCCAC 
ThrThrGluSecMetPcoPheAsnValAlaGluGlyLysGluValLeuLeuLeuValHis 



250 270 



290 



AATCTCCCCCAGCAACTTTTTGGCTACAGCTGGTACAAAGGGG/^GAGTGGATGGCAAC 
AsnLeuProGlnGlnLeuPheGlyTyrSerTrpTyrLysGlyGluArgValAspGlyAsn 



310 330 



350 



CGTCAAATTGTAGGATATGCAATACGAACTCAACAAGCTACCCCAGGGCCCGCAAACAGC 
ArgGlnlleValGlyTycAlalleClyThcGlnGlnAlaThrProGlyProAl^tCnSer 

370 390 410 

GGTCGAGAGACAATATACCCCAATGCATCCCTGCTGATCCAGAACCTCACCCAGAATCAC 
GlyArgGluThrlleTyrProAsnAlaSerLeuLeuIleClnAsnValThrClnAsnAsp 

430 450 <70 

ACACCATTCTACACCCTACAAGTCATAAACTCAGATCTTGrGAATGAACAACCAACTCCA 

ThrClyPheTycThrLeuClnValUeLysSerAspLeuValAsnGluGluAlaThrGly 
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490 ^ 510 530 

CAGTTCCATGTATACCCGGACCTCCCCAJ^GCCCTCCATCTCCAGCAACAACTCCAACCCT 
GlnPheHisValTy cProGluLeuProLysProSerlleSerSerAsnAsnSecAsnPro 



550 570 590 

gtggaggacaacgatgctgtggccttcacctgtg;u\cctgagactcagcacacaacctac 
valGluAspLysAspAlaValAlaPheThrCysGluPioGluThrGlnAspThrThrTyr 

610 630 650 

CTGTGCTGGATAAACAATCAGAGCCTCCCGGTCAGTCCCAGCCTGCAGCTGTCCAATGGC 
LeuTrpTrpIleAsTiAsnClnSe cLeuProValSe rProAcgLeuGlnLeuSe cAsnGly 



670 690 710 

AACAGGACCCTCACTCTACTCAGTGTCACAJVGGAATGACACACGACCCTATGAGTGTGAA 
AsnArgThrCeuThr LeuLcuSe cvalThrArgAsnAspThrGlyProTyrGluCysGlu 



730 750 770 

ATACAGAACCCAGTGAGTGCGAACCCCAGTGACCCAGTCACCTTGAATGTCACCTATGGC 
I leGlnAsnProValSe cAlaAsnArgSerAspProValThcLeuAsnValThi Ty rGly 



790 810 830 

CCGGACACCCCCACCATTTCCCCTTCAGACACCTATTACCGTCCAGCGGCAAACCTCAGC 
ProAspThcProThrlleSerProSerAspThrTyrTy rArgPcoGlyAlaAsnLeuSer 



850 870 890 

CTCTCCTGCTATGCAGCCTCTAACCCACCTGCACAGTACTCCTGGCTTATCAATGG.\ACA 
LeuSe cCysTyrAlaAlaSe cAsnProProAlaGlnTy cSerTrpLeuIleAsnGlyThc 



910 930 950 

TTCCAGCAAAGCACACAAGAGCTCTTTATCCCTAACATCACTGTGAATAATAGTGGATCC 
PheGlnGlnSc cThcGlnCluLeuPhc llePcoAsnlleThcValAsnAsnSc cGlySe r 

970 990 1010 

TATACCTGCCACGCCAATAACTCAGTCACTGGCTGCAACAGGACCACAGTCAAGACGATC 
Ty rThcCysHisAlaAsnAsnSe cValThrGlyCysAsnAcgTh rTh cVal Ly sTh r I Ic 
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1030 1050 1070 

ATACTCACTGATAATGCTCTACCAC^AGAAAATGGCCTCTCACCTGGGGCCATTGCTGGC 
5 IleValThi-AspAsnAlaLeuFcoGlnCluAsnClyLeuSerProGlyAlalleAlaCly 

1090 1110 1130 

ATTGTGATTCGAGTACTGGCCCTGGTTGCTCTGATAGCAGTAGCCCTGGCATGTTTTCTG 
IleVallleClyValValAlaLeuValAlaLeuIleAlaValAlaLeuAlaCysPheLeu 

1150. 1170 " 1190 

75 CATTTCCGGAACACCGGCAGCTCAGGACCACTCCAATGACCCACCTAACAAGATGAATGA 
Hi sPheGlyLysThrGlySe rSe rGlyProLeuGln 

12-10 1230 12S0 

20 ...... 

AGTTACTTATTCTACCCTGAACTTTCAAGCCCAGCAACCCACACAACCAACTTCAGCCTC 

1270 1290 1310 

25 CCCATCCCTAACAGCCACAGAAATAA.TTTATTCAGAAGTAAAAAAGCAGTAATCAA.^CCT 

1330 

30 GAAAV^AAAAAAAAAAJ>lAA 



The present invention is also directed to a replicable recombinant cloning vehicle ("vector") having an 
insert comprising a nucleic acid, e.g., DNA, which comprises a base sequence which codes for a CEA 
35 peptide or a base sequence hybridlzable therewith. 

This invention also relates to a cell that is transformed/transfected, infected or injected with the above 
described replicable recombinant cloning vehicle or nucleic acid hybridlzable with the aforementioned 
cDNA. Thus the invention also concerns the transfection of cells using free nucleic acid, without the use of 
a cloning vehicle. 

40 Still further, the present invention concerns a polypeptide expressed by the above described transfec- 
ted, infected or injected cell, which polypeptide exhibits immunological cross-reactivity with a CEA. as well 
as labelled forms of the polypeptide. The invention also relates to polypeptides having an amino acid 
sequence, i.e.. synthetic peptides, or the expression product of a cell that is transfected, injected, infected 
with the above described replicable recombinant cloning vehicles, as well as labelled forms thereof. Stated 

45 otherwise, the present invention concerns a synthetic peptide having an amino acid sequence correspond- 
ing to the entire amino acid sequence or a portion thereof having no less than five amino acids of the 
aforesaid expression product. 

The invention further relates to an antibody preparation specific for the above described polypeptide. 
Another aspect of the invention concerns an immunoassay method for detecting CEA or a functional 

50 equivalent thereof in a test sample comprising 

(a) contacting the sample with the above described antibody preparation, and 

(b) determining binding thereof to CEA in the sample. 

The invention also is directed to a nucleic acid hybridization method for detecting a CEA or a related 
nucleic acid (DNA or RNA) sample in a test sample comprising 
55 (a) contacting the test sample with a nucleic acid probe comprising a nucleic acid, which comprises a 
base sequence which codes for a CEA peptide sequence or a base sequence that is hybridlzable 
therewith, and 

(b) determining the formation of the resultant hybridized probe. 
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The present Invention also concerns a method for detecting the presence of carcinoennbryonic antigen 
or a functional equivalent thereof in an animal or human patient in vivo comprising 

a) introducing into said patient a labeled (e.g., a radio-opaque material that can be detected by X-rays, 
radiolabeled or labeled with paramagnetic materials that can be detected by NMR) antibody preparation 
according to the present invention and 

b) detecting the presence of such antibody preparation in the patient by detecting the label. 

In another aspect, the present invention relates to the use of an antibody preparation according to the 
present invention for therapeutic purposes, namely, attaching to an antibody preparation radionuclides, 
toxins or other biological effectors to form a complex and Introducing an effective amount of such complex 
Into an animal or human patient, e.g., by injection or orally. The antibody complex would attach to CEA in a 
patient and the radionuclide, toxin or other biological effector would serve to destroy the CEA expressing 
cell. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a schematic representation of the transmembrane CEA's 
DETAILED DESCRIPTION OF THE INVENTION 

In the parent application 87111/68, published as EP-A-263 933. applicants described the following 

CEA's: 





ATCC No. 


CEA-(a) partial CEA (pcLV7) 
CEA-(b) full coding CEA (pc 15LV7) 
CEA-(c) TM-1 (FL-CEA; pc 19-22) 
CEA-(d) NCA (pcBT 20) 


67709 
67710 
67711 



In the present application, applicants described the following CEA's: 





ATTC No. 


CEA-(e) TM-2 (pc E22) 
CEA-(f) TM-3 (pc HT-6) 
CEA-(g) TM-4. 


67712 
67708 



ATCC Nos. 67708. 67709, 67710. 67711 and 67712 were all deposited with the American Type Culture 
Collection on May 25. 1988. 



on 
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The sequences for CEA-(a), CEA-(b), CEA-(c) and CEA-(d) are given herelnbelow: 



CEA-(a) : 



10 



15 



20 



25 



30 



35 



CCaCTTTACACAACC^CACOrcATCAMOOCTlCATCi^AOCA^ 
CAC GAT CyvC CAT OCT CTA OX TTA AOC TCT GAA err CAG ATT CAC 
TCCTOCCTAAATAATCACAOCCrcCCCCTCAGTOOCAOCCTCCW 
AC3C ACC CIC ACT CTA CTC ACT CnC ACA AOC AAT O^T CTA (» or TAT GW^ 
CACAACGAAmACTCXTGftCCACAOCGACCCACTCAOCCAGCGATICCTCTATOOCCEA 
CAC GAC CO: AOC ATT TX go: TCA TAC ACC TAT TAC CCT CCA GOC CTC GAA CCT CAC CCT 
CIC TCC CAT a:A OOC TCT AAC OCA OCT OCA GAG TAT ICr TC» CTC ATT GAT GOG ACC CIC 
CACCAACACACACAACACCTCTrTATCTCCAACATCACTGACAACAACAOCCGACTCTAT 
ACC TGC CAC OCC AAT AAC TCA OCC ACTGOCACAOCAGGACrACACTCAACACAATCACAC 
TCTCrcCCGATCCOCAACCXTCCATCTCCAGCAACAACTOCAAACOCCTOGAGGACA^ 
CATCarTCTGOCCTTCACTCTGAACCTGACOCTCACAACACAACCTACCTCTOCTCCCTA 
AATOCTCACAGCCICCCACTCAGTaXAOCCTCOCCTCTCCAATOQCAACAGGACCCTC 
ACT CTA TIC AAT CIC ACA AGA AAT GAC OCA AGA OCC TAT CTA TCT CGA ATC CAG JWC TCA 

GTCACTOCAAACCOCACTGACOCACTCACCClCGArCTCCTCT^TCGCCCGGACACCCa: 
Arc ATT TOC CCC OOC CC 



40 



45 



(b) 



10 



20 



30 



40 



SO 



C ACC ATG GAG rCT CCC TCG GCC CCT CTC CAC AGa">GG TGC ATC CCC T6G CAG AGG CTC 
M.t Glu Ser Pro S«r AU Pro Leu Hit Arg Trp Cys He Pro Trp Cln Arg Leu 



50 



55 
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60 70 80 90 100 no 

CTG CTC ACA GCC TCA CTT CTA ACC TTC TGG AAC CCG CCC ACC ACT GCC AA6 CTC ACT 
Leo Leu Thr AU Ser Leu Leu Thr Phe Trp Asn Pro Pro Thr Thr AU Lys Leu Thr 

1 2 3 



120 130 UO 

• • • 

ATT GAA TCC ACC CCG TTC AAT GTC GCA GAG 
He Glu Ser Thr Pro Phe Asn V*l AU Glu 
4 5 6 7 8 9 10 11 12 13 



ISO 160 170 

■ • • 

GGG AAG GAG 6TG CTT CTA CTT GTC CAC 
Gly Lys Glu Val Leu Leu Leu V4I His 
14 15 16 17 18 19 20 21 22 



180 190 : 

AAT CTG CCC CAG CAT CTT TTT GGC TAC 
Asn Leu Pro Gin His Leu Phe Gly Tyr 
23 24 25 26 27 28 29 30 31 



!00 210 220 

■ • • 

AGC TGG TAC AAA 6GT GAA AGA GT6 GAT GGC 
Ser Trp Tyr Lys Gly Glu Arg Val Asp Gly 
32 33 34 35 36 37 38 39 40 41 



230 240 250 

• • • 

AAC CGT CAA ATT ATA GGA TAT 6TA ATA 
Asn Arg Gin He He Gly Tyr Val He 
42 43 44 45 46 47 48 49 SO 



260 270 280 

• • • 

GGA ACT CAA CAA GCT ACC CCA GGG CCC GCA 
Gly Thr Gin Gin AU Thr Pro Gly Pro AU 
51 52 53 54 55 56 57 58 59 60 



290 300 310 320 330 340 

TAC AGT 6GT CGA GAG ATA ATA TAC CCC AAT GCA TCC CTG CTG ATC CAG AAC ATC ATC 

. Tyr Ser Gly Arg Glu He He Tyr Pro Asn AU Ser Leu Leu He Gin Asn He He 

61 62. 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 



350 360 37( 

■ • • 

CAG AAT GAC ACA GGA TTC TAC ACC CTA 
Gin Asn Asp Thr Gly Phe Tyr Thr Leu 
80 81 82 83 84 85 86 87 88 



» 380 390 400 

■ • ■ 

CAC GTC ATA AAG TU GAT CTT 6TG AAT GAA 
His Val lit Lys Ser Asp Leu Val Asn Glu 
89 90 91 92 93 94 95 96 97 98 



410 420 430 440 450 

GAA GCA ACT GGC CAG TTC CGG GTA TAC CCG GAG CTG CCC AAG CCC TCC ATC TCC AGC 

Glu AU Thr Gly Gin Phe Arg Vtl Tyr Pro Glu Leu Pro Lys Pro Ser He Ser Ser 

99 101 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 



460 470 480 

" • • 

AAC AAC TCC AAA CCC GTG GAG GAC AAG 
Asn Asn Ser Lys Pro V*l Glu Asp Lys 
118 119 120 121 122 123 124 125 126 



490 500 510 

• ■ • 

GAT GCT GTG GCC TTC ACC TGT GAA CCT GAG 
Asp AU VaI AU Phe Thr Cys Glu Pro Glu 
127 128 129 130 131 132 133 134 135 136 



00 
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520 530 540 550 560 570 

« mm m • • 

ACT CAG GAC GCA XCC TAC CTG TGG TGG GTA AAC AAT CAG AGC CTC CCG GTC AGT CCC 
5 Thr Gin Asp AU Thr Tyr Leu Trp Trp VaI A$n Asn Gin Scr Leu Pro Vil Ser Pro 

137 138 139 140 141 142 143 144 145 146 147 148 149 ISO 151 152 153 154 155 



580 590 600 610 620 

• • • « • 

AGG CTG CAG CTG TCC AAT GGC AAC AGG ACC CTC ACT CTA TTC AAT GTC ACA AGA AAT 
Arg Leu Gin Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn VaI Thr Arg Asn 
156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 



75 630 640 650 660 670 680 

mm mm* • 

GAA CAA GCA AGC TAC AAA TGT GAA ACC CAG AAC CCA GTG AGT GCC AGG CGC AGT GAT 

Glu Gin Al* Ser Tyr Lys Cys Glu Thr Gin Asn Pro Vil Ser AU Arg Arg Ser Asp 

175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 

20 

690 700 710 720 730 740 

a « , • • • 

TCA GTC ATC CTG AAT GTC CTC TAT GGC CCG GAT GCC CCC ACC ATT TCC CCT CTA AAC 
Ser V4l He Leu Am VaT Leu Tyr Gly Pro Asp Al* Pro Thr lie Ser Pro Leu Asn 
26 196 ig? 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 



750 ' 760 770 780 790 

■ • m 9 • 

ACA TCT TAC AGA Ta GGG GAA AAT CTG AAC CTC TCC TGC CAC GCA GCC TCT AAC CCA 

Thr Ser Tyr Arg Ser Gly Glu Asn Leu Asn Leu Ser Cys His AU AU Ser Asn Pro 

213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 



800 810 820 830 840 850 

35 

CCT GCA CAG TAC TCT TGG TTT GTC AAT GGG ACT TTC CAG CAA TCC ACC CAA GAG CTC 
Pro AU Gin Tyr Ser Trp Phe V«l Asn Gly Thr Phe Gin Gin Ser Thr Gin Glu Leu 
232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 

40 

860 870 880 890 900 910 

• • • • • • 

TTT ATC CCC AAC ATC ACT GTG AAT AAT AGT GGA TCC TAT ACG TGC CAA GCC CAT AAC 
Phe lie Pro Asn lie Thr V*l Asn Asn Ser Gly Ser Tyr Thr Cys Gin AU HU Asn 
251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 

45 

920 g30 940 950 960 970 

• • ■ • • • 

TCA GAC ACT GGC CTC AAT AGG ACC ACA GTC ACG ACG ATC ACA GTC TAT GCA GAG CCA 
50 Ser Asp Thr Gly Leu Asn Arg Thr Thr Val Thr Thr He Thr V*l Tyr AU Glu Pro 

270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 



55 



EP 0 346 710 B1 



980 990 1000 1010 1020 

• • • mm 

CCC AAA CCC TTC ATC ACC AGC AAC AAC TCC AAC CCC GTG GAG GAT GAG GAT GCT GTA 
Pro Lys Pro Phe He Thr Scr Asn Asn Ser Asn Pro VaI G1u Asp Glu Asp AU Val 
289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 



1030 1040 1050 1060 1070 1080 

• m m m • • 

GCC TTA ACC TGT GAA CCT GAG ATT CAG AAC ACA ACC TAC CTG TGG TGG GTA AAT AAT 
Al4 Leu Thr Cys Glu Pro Glu lie Gin Asn Thr Thr Tyr Leu Trp Trp Val Asn Asn 
308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 



75 1090 1100 1110 1120 1130 1140 

m • ' • m m • 

CAG AGC CTC CCG GTC AGT CCC AGG CTG CAG CTG TCC AAT GAC AAC AGG ACC CTC ACT 
Gin Ser Leu Pro Val Ser Pro Arg Leu Gin Leu Ser Asn Asp Asn Arg Thr Leu Thr 
327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 

20 

1150 1160 1170 1180 1190 

■ ■ • • • 

CTA CTC AGT GTC ACA AGG AAT GAT GTA GGA CCC TAT GAG TGT GGA ATC CAG AAC GAA 
Leu Leu Ser Val Thr Arg Asn Asp Val Gly Pro Tyr Glu Cys Gly He Gin Asn Glu 
25 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 



1200 1210 1220 1230 1240 1250 

mm m m m m 

TTA AGT GTT GAC CAC AGC GAC CCA GTC ATC CTG AAT GTC CTC TAT GGC CCA GAC GAC 
Leu Ser Val Asp His Ser Asp Pro Val He Leu Asn Val Leu Tyr Gly Pro Asp Asp 
365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 



1260 1270 1280 1290 1300 1310 

• • ■ « • • 

CCC ACC ATT TCC CCC TCA TAC ACC TAT TAC CGT CCA GGG GTG AAC CTC AGC CTC TCC 
Pro Thr He Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly Val Asn Leu Ser Leu Ser 
384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 



1320 1330 1340 1350 1360 

• ■ m m m 

TGC CAT GCA GCC TCT AAC CCA CCT GCA CAG TAT TCT TGG CTG ATT GAT GGG AAC ATC 
Cys His Ala Ala Ser Asn Pro Pro Ala Gin Tyr Ser Trp Leu Ho Asp Gly Asn He 
403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 



1370 1380 1390 1400 1410 1420 

m m m m m m 

CAG CAA CAC ACA CAA GAG CTC TTT ATC TCC AAC ATC ACT GAG AAG AAC ACC GGA CTC 

Gin Gin His Thr Gin Glu Leu Phe He Ser Asn He Thr Glu Lys Asn Ser Gly Leu 

422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 



55 



OA 
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U30 1440 I4S0 

• ■ ■ 

TAT ACC TGC CAG GCC AAT AAC TCA GCC ACT 
Tyr Thr Cy$ Gin AU Asn Asn Ser Ala Ser 
441 442 443 444 445.446 447 448 449 4S0 



1460 1470 1480 

• • • 

GGC CAC AGC AGG ACT ACA GTC AAG ACA 

Gly His Ser Arg Thr Thr V4I Ly$ Thr 

451 452 453 454 4S5 456 457 458 459 



1490 1500 1510 1520 1530 1540 









• 


• • 


• ■ • 


10 


ATC 


ACA 


GTC TCT GCG 


GAC 6TG CCC AAG CCC TCC ATC 


TCC AGC AAC AAC TCC AAA CCC 




He 


Thr 


VaI Ser AU 


Asp V4I Pro Lys Pro Str Ilo 


Sor Scr Asn Asn Scr Lys Pro 




460 


461 


462 463 464 


465 466 467 468 469 470 471 


472 473 474 475 476 477 478 


IS 






1550 
• 


1560 1570 
• • 


1580 1590 
• • 




GT6 


GAG 


GAC AAG GAT 


GCT 6TG QCC TTC ACC T6T GAA 


CCT GAG GCT CAG AAC ACA ACC 




V»l 


Glu 


Asp Lys Asp 


AlA Val AU Pho Thr Cys Glu 


Pro Glu AU Gin Asn Thr Thr 




479 


480 


481 482 483 


484 485 486 487 488 489 490 


491 492 493 494 495 496 497 



1600 1610 1620 

■ • ■ 

TAC CTG TGG TGG GTA MT G6T CAG AGC CTC 
Tyr Leu Trp Trp V»1 Asn Gly Gin Str Leu 
498 499 500 501 502 503 504 505 506 507 



1660 1670 1660 

• . • « 

AAT GGC AAC AGG ACC CTC ACT CTA TTC AAT 
Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn 
517 518 519 520 521 522 523 524 525 526 



1630 1640 1650 

• • ■ 

CCA GTC ACT CCC AGG CTG CAG CTG TCC 
Pro VaI Sor Pro Arg Leu Gin Leo Ser 
508 509 510 511 512 513 514 SIS 516 



1690 1700 1710 

■ mm 

GTC ACA AGA AAT GAC GCA AGA GCC TAT 
V»l Thr Arg Asn Asp AU Arg AU Tyr 
527 528 529 530 531 532 533 534 535 



1720 1730 

m m 

6TA TGT GGA ATC CAG AAC TCA CTG ACT 
Vil Cys Gly He Gin Asn Ser Val Ser 
536 537 538 539 540 541 542 543 544 



17S0 1760 





• 




■ 






GCA AAC CGC 


ACT GAC 


CCA GTC 


ACC 


CTG 


GAT 


AU Asn Arg 


Ser Asp 


Pro V4I 


Thr 


Leu 


Asp 


545 546 547 


548 549 


550 551 


552 


553 


554 



40 



1770 



1780 



1790 



1800 



1810 



1820 



GTC CTC TAT GGG CCG GAC Aa CCC ATC ATT TCC CCC CCA GAC TCG TCT TAC CTT TC6 
V4I Leu Tyr Gly Pro Asp Thr Pro He He Ser Pro Pro Asp Sor Sor Tyr Leu Ser 
555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 



45 



1830 



1840 



1850 



1860 



1870 



1880 



50 



GGA GCG AAC CTC AAC CTC TCC TGC CAC TCG CCC TCT AAC CCA TCC CCC CAG TAT TCT 
Gly AU Asn Leu Asn Leu Ser Cys His Ser AU Ser Asn Pro Ser Pro Gin Tyr Ser 
574 575 576 577 578 579 580 581 582 583 584 565 586 587 588 589 590 591 592 



55 
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1890 1900 1910 1920 1930 

• • • • 

TGG CGT ATC AAT GGG ATA CCC CA6 CAA CAC ACA CAA GTT CTC TTT ATC GCC AAA ATC 

Trp Arg He A$n Gly He Pro Gin Gin Hit Thr Gin Vil Leu Phe lie Ala Uys He 

593 594 595 596 59/ 598 S99 600 601 602 603 604 605 606 607 608 609 610 611 



1940 1950 1960 1970 1980 1990 

« « • • • ■ 

ACG CCA AAT AAT AAC GGG ACC TAT GCC TGT TTT GTC TCT AAC TTG GCT ACT GGC CGC 
Thr Pro Asn Asn Asn Gly Thr Tyr AU Cys Phc VaI Ser Asn Leu AU Thr Gly Arg 
612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 



2000 2010 2020 2030 2040 2050 

m m m m m m 

AAT AAT TCC ATA GTC AA6 AGC ATC ACA GTC TCT GCA TCT GGA ACT TCT CCT GGT CTC 
Asn Asn Ser He Val Lys Ser He Thr Val Str AU Ser Gly Thr Ser Pro Gly Leu 
631 632 633 634 635 636 637 636 639 640 641 642 643 644 645 646 647 648 649 



2060 2070 2080 2090 2100 2110 

■ • ■ m m m 

TCA GCT GGG GCC ACT GTC GGC ATC ATG ATT GGA GTG CTG GTT GGG GTT GCT CTG ATA 

Ser Ala Gly AU Thr Val Gly lit Het lie Gly Val Leu Val Gly Val AU Leu He 

650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 



2120 2130 2140 2150 2160 

■ ■ • mm 

TAG CAG CCC TGG TGT AGT TTC TTC ATT TCA GGA AGA CTG ACA GTT GTT TTG CTT CTT 

2170 2180 2190 2200 2210 2220 

« • • ■ • ■ 

CCT TAA AGC ATT TGC AAC AGC TAC AGT CTA AAA TTG CTT CTT TAC CAA GGA TAT TTA 

2230 2240 2250 2260 2270 2260 

• ■ ■ ■ • ■ 

CAG AAA ATA CTC TGA CCA GAG ATC GAG ACC ATC CTA GCC AAC ATC GTG AAA CCC CAT 

2290 2300 2310 2320 2330 

• • * • ■ 

CTC TAC TAA AAA TACJUA AAT GAG CTG GGC TTG GTG GCG CGC ACC TGT AGT CCC AGT 
2340 2330- 2360 2370 2380 2390 

• • m m m m 

TAC TCG GGA GGC TGA GGC AGG AGA ATC GCT TGA ACC CGG GAG GTG GAG ATT GCA GTG 
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70 



75 



20 



25 



30 



35 



2400 2410 2420 2430 2440 2450 

■ • • ■ • a 

AGC CCA GAT CGC ACC ACT QCA CTC CAG TCT GGC AAC AGA GCA AGA CTC CAT CTC AAA 

2460 2470 2480 2490 2500 

■ • • • • 

AAG AAA AGA AAA GAA GAC TCT GAC CTG TAC TCT TGA ATA CAA GTT TCT GAT ACC ACT 
2510 2520 2530 2540 2550 2560 

■ • m a m m ■ 

GCA CTG TCT GAG AAT TTC CAA AAC TTT AAT GAA CTA ACT GAC AGC, TTC ATG AAA CTG 

2570 2580 2590 2600 2610 2620 

• • ■ m m m 

TCC ACC AAG ATC AAG CAG AGA AAA TAA TTA ATT TCA TGG GGA CTA AAT GAA CTA ATG 
2630 2640 2650 2660 2670 2680 

• • • m w m 

AG6 ATA ATA TTT TCA TAA TTT TTT ATT TGA AAT TTT GCT GAT TCT TTA AAT GTC TTG 
2690 " 2700 2710 2720 2730 

m m m mm 

TTT CCC AGA TTT CAG GAA ACT TTT TTT CTT TTA AGC TAT CCA CTC TTA CAG CAA TTT 
2740 2750 2760 2770 2780 2790 

• ■ • • mm 

GAT AAA ATA TAC TTT TGT GAA CAA AAA TTC AGA CAT TTA CAT TTT ATC CCT ATG TGG 

2800 2810 2820 2830 

• • • ■ 

TC6 CTC CAG ACT TGG GAA ACT ATT CAT GAA TAT TTA TAT TGT ATG 



40 



45 



50 



55 
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CEA-(c) 



10 30 50 

• ••••• 

CAGCCGTGCTCGAAGCGTTCCTGGAGCCCAAGCTCTCCTCCACAGGTGAAGACAGGGCCA 
70 90 110 

• • • • • • 

GCAGGAGACACCATGGGGCACCTCTCAGCCCCACTTCACAGAGTGCGTGTACCCTGGCAG 
NetGlyHisLeuSecAlaProLeuHisArgValArgValProTrpGln 

130 150 170 

• ■ • • • • 
GGGCTTCTGCTCACAGCCTCACTTCTAACCTTCTGGAACCCGCCCACCACTGCCCAGCTC 
GlyLeuLeuLeuThrAlaSerLeuLeuThrPheTrpAsnProProThrThrAlaGlnLeu 

190 210 230 

ACTACTGAATCCATGCCATTCAATGTTGCAGAGGGGAAGGAGGTTCTTCTCCTTGTCCAC 
ThrThrGluSerMetProPheAsnValAlaGluGlyLysGluValLeuLeuLeuValHis 

250 270 290 

AATCTGCCCCAGCAACTTTTTGGCTACAGCTGGTACAAAGGGGAAAGAGTGGATGGCAAC 
AsnLeuProGlnGlnLeuPheGlyTyrSecTrpTyrLysGlyGluArgValAspGlyAsn 

310 330 350 

CGTCAAATTGTAGGATATGCAATAGGAACTCAACAAGCTACCCCAGGGCCCGCAAACAGC 
ArgGlnlleV^lGlyTyrAlalleGlyThrGlnGlnAlaThrProGlyProAlaAsnSer 

370 390 410 

• • • • • • 

GGTCGAGAGACAATATACCCCAATGCATCCCTGCTGATCCAGAACGTCACCCAGAATGAC 
GlyArgGluThrlleTyrProAsnAlaSerLeuLeuIleGlnAsnValThrGlnAsnAsp 

430 450 470 

ACAGGATTCTACACCCTACAAGTCATAAAGTCAGATCTTGTGAATGAAGAAGCAACTGGA 

ThrGlyPheTy cThrLeuGlnVallleLysSerAspLeuValAsnGluGluAlaThrGly 
490 510 530 



oo 
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CAGTTCCATGTATACCCGGAGCTGCCCAAGCCCTCCATCTCCAGCAACAACTCCAACCCT 
GlnPheHisVslTyrProGluLeuProLysProSerlleSerSerAsnAsnSecAsnPro 

550 570 590 

CTGGAGGACAAGGATGCTGTGGCCTTCACCTGTGAACCTGAGACTCACGACACAACCTAC 
ValGluAspLysAspAlaValAlaPJieThrCysGluProGluThrGlnAspThrThrTyr 

610 630 650 

• • * - . , 

CTGTGGTGGATAAACAATCAGAGCCTCCCGGTCAGTCCCAGGCTGCAGCTGTCCAATGGC 
LeuTrpTrpIleAsnAsnGlnSerLeuProValSerProAcgLeuGlnLeuSerAsnGly 

670 690 710 

• •••«, 
AACAGGACCCTCACTCTACTCAGTGTCACAAGGAATGACACAGGACCCTATGAGTGTGAA 
AsnArgThrLeuThrLeuLeuSe rValThcArgAsnAspThrGlyProTy rGluCysGlu 

730 750 770 

ATACAGAACCCAGTGAGTGCGAACCGCAGTGACCCAGTCACCTTGAATGTCACCTATGGC 
IleGlnAsnProValSerAlaAsnArgSerAspProValThrLeuAsnValThrTyrGly 

790 810 830 

CCGGACACCCCCACCATTTCCCCTTCAGACACCTATTACCGTCCAGGGGCAAACCTCAGC 

ProAspThcProThrlleSerProSerAspThrTyrTyrArgProGlyAlaAsnLeuSer 

850 870 890 

* • • . . 

CTCTCCTGCTATGCAGCCTCTAACCCACCTGCACAGTACTCCTGGCTTATCAATGGAACA 
LeuSerCysTyrAlaAlaSerAsnProProAlaGlnTyrSerTrpLeuIleAsnGlyThc 

910 930 950 

TTCCAGCAAAGCACACAAGAGCTCTTTATCCCTAACATCACTGTGAATAATAGTGGATCC 
PheGlnGlnSecThrGlnGluLeuPhelleProAsnllcThrValAsnAsnSerGlySer 

970 990 1010 

TATACCTGCCACGCCAATAACTCAGTCACTGGCTGCAACAGGACCACAGTCAAGACGATC 
TyrThcCysHisAldAsnAsnSecValThcGlyCysAsnArgThrThcValLysThcIle 

1030 1050 1070 
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ATAGTCACTCAGCTAAGTCCACTAGTAGCAAAlJCCCCAAATCAAAGCCACCAAGACCACA 
IleValThrCluLeuSerProValvalAlaLysProGlnlleLysAlaSerLysThrThr 

1090 1110 1130 

GTCACAGGAGATAAGGACTCTGTGAACCTGACCTGCTCCACAAATGACACTGGAATCTCC 
ValThrClyAspLysAspSerValAsnLeuThrCysSerThrAsnAspThrGlylleSe r 

1150 1170 1190 

ATCCGTTGGTTCTTCAAAAACCAGAGTCTCCCGTCCTCGGAGAGGATGA.»kGCTGTCCCAG 
IleArgTrpPhePheLysAsriGlnSerLeuProSerSerGluArgMe tLysLeuSerGln 

1210 1230 1250 

GGCAACACCACCCTCAGCATAAACCCTGTCAAGAGGGAGGATGCTGGGACGTATTGCTCT 
GlyAsnThrThrLeuSerlleAsnProValLysAcgGluAspAlaGlyThcTyrTrpCys 

- 1270 1290 1310 

• • • • • , 
GAGGTCTTCAACCCAATCAGTAAGAACCAAAGCGACCCCATCATGCTGAACGTAAACTAT 
GluValPheAsnP roll eSerLysAsnGlnSecAspP roll eMetLeuAsnValAsnTyr 

1330 1350 1370 

• • • a . . 

AATGCTCTACCACAAGAAAATGGCCTCTCACCTGGGGCCATTGCTGGCATTGTGATTGGA 
AsnAlaLeuProGlnGluAsnGlyLeuSerProGlyAlalleAlaGlylleVallleGly 

1390 1410 1430 

• • • • . « 
GTAGTGGCCCTGGTTGCTCTGATAGCAGTAGCCCTGGCATGTTTTCTCCATTTCGGGAAG 
ValValAlaLeuValAlaLeuIleAlaValAlaLeuAlaCysPheLeuHisPheGlyLys 

1450 1470 1490 

• • • . . , 
ACCGGCAGGGCAAGCGACCAGCGTGATCTCACAGAGCACAAACCCTCAGTCTCCAACCAC 
ThrGlyArgAlaSerAspGlnArgAspLeuThcGluHisLysProSerValSerAsnHis 

1510 1530 1550 

ACTCAGGACCACTC.CAATGACCCACCTAACAAGATGAATGAACTTACTTATTCTACCCTG 
ThrGlnAspHi sSe cAsnAspProProAsnLysMetAsnGluValThcTy rSe rThcLeu 

1570 1590 1610 



on 
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AACTTTGAAGCCCAGCAACCCACACAACCAACTTCAGCCTCCCCATCCCTAACAGCCACA 
AsnPheGluAlaGlrvGlnProThrGlnProThrSerAlaSe rProSe cLeuThrAlaThr 



1630 1650 1670 

GAAATAATTTATTCAGAAGTAAAAAAGCAGTAATGAAACCTGTCCTGCTCACTGCAGTGC 
GluIlelleTyrSerGluValLysLysGln 

1690 1710 1730 

TGATGTATTTCAAGTCTCTCACCCTCATCACTAGGAGATTCCTTTCCCCTGTAGGGTAGA 

1750 1770 1790 

GGGGTGGGGACAGAAACAACTTTCTCCTACTCTTCCTTCCTAATAGGCATCTCCAGGCTG 

1810 1830 1850 

•••••• 

CCTGGTCACTGCCCCTCTCTCAGTGTCAATAGATGAAAGTACATTGGGAGTCTGTAGGAA 

1870 1890 1910 

• ••••• 
ACCCAACCTTCTTGTCATTGAAATTTGGCAAAGCTGACTTTGGGAAAGAGGGACCAGAAC 

1930 1950 1970 

• ■ • • • • • 

TTCCCCTCCCTTCCCCTTTTCCCAACCTGGACTTGTTTTAAACTTGCCTGTTCAGAGCAC 

1990 2010 2030 

TCATTCCTTCCCACCCCCAGTCCTGTCCTATCACTCTAATTCGGATTTGCCATAGCCTTG 

2050 2070 2090 

• * • • • • 

AGGTTATGTCCTTTTCCATTAAGTACATGTGCCAGGAAACAGCGAGAGAGAGAAAGTAAA 
2110 .. 2130 2150 

• • • • • • 

CGGCAGTAATCCTTCTCCTATTTCTCCAAAGCCTTGTGTGAACTAGCAAAGAGAACAAAA 
2170 2190 2210 

• • • • • • 

TCAAATATATAACCAATAGTGAAATGCCACAGGTTTGTCCACTGTCAGGGTTGTCTACCT 
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2230 2250 2270 

GTAGGATCAGGGTCTAAGCACCTTGGTGCTTAGCTAGAATACCACCTAATCCTTCTGGCA 

2290 2310 2330 

. . • • • 

AGCCTCTCTTCAGACAACCCACTAGAAGCAACTAGGAAAAATCACTTGCCAAAATCCAAG 

2350 2370 2390 

GCAATTCCTGATGGAAAATGCAAAAGCACATATATGTTTTAATATCTTTATGGGCTCTGT 

2410 2430 2450 

. , • • • • 

TCAAGGCAGTGCTGAGAGGGAGGGGTTATAGCTTCAGGAGGGAACCAGCTTCTGATAAAC 
2470 2490 2510 

• • • • . • • • 

ACAATCTGCTAGGAACTTGGGAAAGGAATCAGAGAGCTgCtCTTCAGCGATTATTTAAAT 

2530 2550 2570 

TGTTAAACAATACACAATTTGGGGTATTGGGATTTTTCTCCTTTTCTCTGAGACATTCCA 

2590 2610 2630 

• • • • • 

CCATTTTAATTTTTGTAACTGCTTATTTATGTGAAAAGGGTTATTTTTACTTAGCTTAGC 

2650 2670 2690 

• • . . • • 

TATGTCAGCCAATCCGATTGCCTTAGGTGAAAGAAACCACCGAAATCCCTCAGGTCCCTT 
2710 2730 2750 

. • • • • 

GGTCAGGAGCCTCTCAAGATTTTTTTTGTCAGAGGCTCCAAATAGAAAATAAGAAAAGGT 

2770 2790 2810 

• . • • • . • • 

TTTCTTCATTCATGGCTAGAGCTAGATTTAACTCAGTTTCTAGGCACCTCAGACCAATCA 

2830 2650 2B70 

« • • ■ • • 

TCAACTACCATTCTATTCCATGTTTGCACCTGTGCATTTTCTGTTTCCCCCCATTCACTT 
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2890 2910 2930 

TGTCAGGAAACCTTGGCCTCTGCTAAGGTGTATTTGGTCCITGAGAAGTGGGAGCACCCT 

2550 2970 2990 

ACACGGACACTATCACTCATGCTGGTGGCATTGTTTACAGCTAGAAAGCTGCACTGCTGC 

3010 3030 3050 

TAATGCCCCTTGGGAAATGGGGCTGTGAGGAGGAGGATTATAACTTAGGCCTAGCCTCTT 

3070 3090 3no 

TTAACAGCCTCTGAAATTTATCTTTTCTTCTATGGGGTCTATAAATGTATCTTATAATAA 

3130 3150 3170 

AAAGGAAGGACAGGAGGAAGACAGGCAAATGTACTTCTCACCCAGTCTTCTACACAGATG 

3190 3210 3230 

GAATCTCTTTGGGGCTAAGAGAAAGGTTTTATTCTATATTGCTTACCTGATCTCATGTTA 



3250 3270 3290 

CTCAGGTACCTCTT 



GGCCTAAGAGGCTTTCTCCAGGACGATTAGCTTGGACTTCTCTATA 



3310 3330 3350 

TCAGGGTTTTCTAACCCTGACACGGACTGTGCATACTTTCCCTCATCCATGCTGTGCTCT 

3370 3390 3U0 

CTTATTTAATTTTTCfcTGGCTAAGATCATGTCTGAATTATGTATGAAAATTATTCTATGT 

3430 3450 
TTTTATAATAAAAATAATATATCAGACATCGAAAAAAAAAA 
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(d) 



SO 



10 20 30 

• • • I I 

CC (S6 ((A CAC BCA W CCA ACA 6U ACA SCA SCC CU ACC A&A SCA IIC 06 M C1C 



w 70 eo 10 too 110 

• « « • 11 

AA6 C1C TCT ACA AA6 ASS US ACA SA6 AAS ACA 6CA SAS ACC AtC SGA CCC CCC ICA 

nn 6ty Fr» hp Sir 

130 HO ISO UO 170 

« • I f I I 

SCC CCT CCC ISC ASA tIS CAT SIC CCC US AAS SAS SIC CIS CIC ACA CCC ICA CTI 
Ala ft9 P^^ Cy$ Ar^ Im His Vil fro Up lys Slv Vj| Itu Itu Ihr Ali Sir Uv 

180 1^0 eoo 210 . ?20 230 

« • • I I ' , 

CIA ACC UC ICS AAC CCA CCC ACC ACI SCC AAS CIC ACT ATI SAA ICC* ACS CCA IIC 

Ltv Thr Phi Trp Asa fri ft% thr Thr Alt lyt Iru Thr lit Slu Srr Ihr fr$ phi 

2*0 250 . 2S0 270 280 * 

• I • I I 

AAT STC 6CA SAS S6S AAS SAS STT C1T CIA CIC SCC CAC AAC CIS CCC CAC AAT CBI 

yJi" li^ )V Vx^ ^l^ SP ^ff ^ir. 

2W 300 310 320 330 " 3«* 

< III I , 

AIT SST lAC A6C.IDS lAC AAA G6C CAA ASA CT6 SAT CCC AAC AST CIA All 61A S6A 

Vi? \}J V/ IV iL' !Zf VJ ^'S «»r *W 5" tf« III Vil Sly 

3i0 340 370 380 3W \00 

• III I 4 

lAI STA A1A SSA ACT CAA CAA SCT ACC CCA SS6 CCC 6CA TAC AST 6CI CSA SAS ACA 

^iJ ii' ^J? l!i *l« Sir SW Are Elu Ihr 

'// S/ fj SS Sf j^i' jj sf j» // 4r 4Y 

MO 420 OO UO 430 

•ill! 
ATA lAC CCC AAT 6CA KC CT8 CIS ATC CAS AAC SIC ACC CAS AAT SAC ACA SSA IIC 
"5 ''.7 5" Vil Ihr Sin Am Aw Ur SW fhe 

4; ^ 7/ 7J 7j ;y ;5r 7/ 7/ /i; // /J /J /V lir 

OO 480 OO 300 310 



f 



TAC ACC CIA CAA SIC ATA AAS ICA SAI CIT 616 AAl. SAA SAA SCA ACC SSA CA( ITC ' 

IV V*.^ ti' y» S'i k'S 5ly SU fhf 

*7 X> »tf »/ ?i ff n fg 

320 330 340 330 340 370 



III I 



C£l SIA TAC CCS SAS CIS CCC AAS CCC UC ATC TCC ASC AAC AAC ICC AAC CCC STS 

Kit. Vil lyr fr« 61« ltv ffi lyi frt Srr !li Scr Sir A%ii Atn Sir Asn frt Vil 

/cr /ci ^f/f0 /// 0j Mj ^ /// /y/ ^6 /////^ /✓//// 



OA 
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20 



25 



MO 400 MO 

•till 

SAC itC AA6 BM CCT (1$ BCt MC hit I6t CAA CCI CAC Bt! CAC AAC ACA ACC lAC 

"0 450 M m M 

' < • III 

CIS ICS IG6 6TA AfiT SET CAS ASC CIC CCS SIC AS! CCC ASS CU CAS CIS ICC AAI 
Ifu Ifp 1^p,Vil Ajft gi- y I 5 p ^ ^ . J 

^0 ^'/rm //? m Mno /s/ %/ /i/ /s? //7 /x/ )^'} /i) 

^00 ?|0 720 730 7^0 

• * I • , . , 

SoC AAC ACS ACC CIC ACl CU CIC ASC SIC AAA AGS AAC SAI SCA SGA ICS lAT SAA 

a 5?; )rs Mv )s >;j %} .V; lu })i 

^2-0 740 770 780 7W 600 

' • • I f • 

1ST SAA ATA CA6 AAC CCA 6C6 AST 5CC AAC CSC AST SAC CCA SIC ACC CT6 AAT ETC 

^^i Us hli ^It *JJ Ser All Atfi Arfi Sir Asa Pri. Vil iKr tfu ttn Vjl 

8)0 e?0 B30 8<iO 8^0 

' • • I I 

CIC.IAT S6C CCA SATSGC CCC ACC ATT ICC CCC !CA AAS 6CC AAI lAC CCI CCA SSS 

Iru Tyr Sir frt Asp GIv Pr» Thr lit Srr Pro Srr U\ Ali A%fi Ivf Ant Pri. Au 

870 880 .890 too ' fto 

• » I • . 

CAA AAT CT5 AAC CIC TCC ISC CAC 6CA SCC ICT AAC CCA CCI SCA CAS lAC TCI IGG 

IV- jjt? 'jji 2ui55 f-ivSsjJ^iaViiiii^ jiy^'; Ji: ij: 'jj, 

«o no f^o no no no. 

• If! II 

TTT ATC AAT SGS ACS TIC CAG CAA TCC ACA CAA GAS dC III AlC CCC AAC AlC ACT 
^ 111 S'T ^1*^ SI') Thr Gin Glu Uu Pht ill Prb Ain Ht Thr 

35 980 no . 1000 loto io?o 

• « I I • 

GT6 AAT AAT AGC GGA TCC TAT AI6 TSC CAA GCC CAT AAC ICA SCC ACT SSC CIC AAI 
« !rJ SiD Jji Cri SI5 All His AsR Ser Ali Ihf Sir U« Asit 

40 1030 10^0 1050 1040 1070 1060 

• « III I 

ASG-ACC ACA STC ACS ATS ATC ACA ETC TCT GGA AST ECT CCT SIC CTC TCA SCI SIS 

45 '^^^ '^00 . ino 1120 ino ii\o. 

• « . • • I I 

GCC ACC GIC 6GC ATC ACS ATT GSA SIS CT5 GCC AS6 SIS SCI CIS ATA TAG CAG CCC 
All Thr Vil Glr lit Thr Mt Sly Vil leu Ala Arp V|| All ItB ll» — 

50 



30 



55 
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1150 1110 1170 1110 tltO 

I I I I I , 

US T6T Air nC M ATT TCA GSA A6A C16 SCA 6AI U6 ACC AU CCC ISA Ml C1T 

UOO 1210 1220 1230 i2\0 12S0 

• I I I ' I I 

CIA SCI CCT CCA A1C CCA HI 1A1 CCC A16 SAA CCA CIA AAA ACA ASS Ul SCI CIS 



\2iO 
f 



•270 
I 



12S0 
I 



1290 



1300 



1310 



CTC CIS AAS CCC TAT A1E CIS SAS ATS EAC AAC TCA AI6 AAA kU lAA ASS AAA AfcC 



1320 1330 Ip^O 1350 1360 1370 

I III I I 

CCT CAS sec TSA SST SYS ISC CAC ICA SAS ACT TCA CCT AAC IAS ASA CAS SCA AAC 



1360 1390 MOO H10 tWO 

III II 
ISC AAA CCA nnC CTC TIT CSC TTS SCA SSA ISA TSS TST CAI IAS TAT IIC ACA ASA 



too 

I 



lUO 
I 



U50 



USO 



too 



ASI ADC TIC ASA CSS lAA CU AAC ASA S1A KA SA1 CU 1CI 1S1 CAA KC CftA CST 



1^90. 1500 ISIO 1S20 1S30 15^0 

I I I I i I . 

in ACA lAA AAI AAS CCA ICC HI A6I SCA CCC ASI SAC ISA CAT US CAS CAI CU 

1550 ISSO \m \W 1590 

( I I I I 

* TAA CAC ASC CST SIS IIC AAS TBI ACA SIS SIC CM IIC ASA Sit SSn aM ACI CCA 



l&OO ISIO U20 U30 UVO U50 

■ I I I I I 

ACT SAA ATS TTA ASS AAS AAS ATA SAI CCA AIT AAA AAA AAI lAA AAC CAA 111 AAA 



U&O U70 ueO U90 1700 1710 

• I I, . • I I. 

AAA AAA AAA SAA CAC ASS ASA TIC CAS TCT ACT TSA STI ASC ATA ATA CAS AAS ICC 



1720 1730 1710 1750 17S0 

« t . I • I 

CCT CIA CTT TAA CTT TTA CAA AAA AST AAC CIS AAC TAA TCT CAT STI AAC CAA TSI 
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10 



75 



20 



ntO ifoo 1810 i8;o 

• • • 

All UT UK Kl SBT Id 611 ICC lU IIC Cftfi III 6AC AAA ACC CAC U! Id Id 



IBSO 1S60 1870 1880 

' ' < I I . 

All 6TA 116 CCC A66 S66 ACC TAI CAC Id Ad ISI A(A GI6 d( CU CII lAA %U 



1900 iflo \m ino m^o 
• • • III 

Wl AAA ICA CAA AIA AAA EC^ AAI US dC lAT AAC TAA AAA AAA AAA AAA AAA AAA 



ino 

A;A &AA AAA AAA AAA AAA AAA AAA 



A schematic relationship of the transmembrane CEA's. namely TM-1 (CEA-(c)). TM-2 (CEA-(e)), TM-3 
(CEA-(f)) and TM-4 (CEA-(g)) is depicted in Fig. 1: 

Assuming TM-1 is composed of five sections as depicted in Fig. 1, namely 10, 12, 14, 16 and 18, TM-2 
differs from TM-1 in that the 100 amino acid (100 AA) section 14 is deleted and at splice point 20 betweeen 
25 sections 12 and 16, surprisingly an extra amino acid, namely Asp occurs. 

TM-3 is the same as TM-1 except that section 18 is truncated at splice point 22, I.e.. a section of 70 
amino acids is deleted and results in a new section made up of subsections 24 + 26, Surprisingly, 
however, six new amino acids (section 26) occur. Another example of formation of a novel amino acid 
sequence resulting from a deletion of nucleic acid sequence is for platelet derived growth factor-A. 
30 TM-4 is the same as TM-2 up until the end of subsection 24. 

Subsection 24 is contained in section 18 of TM-1 and TM-2, but is not depicted in Fig. 1 for TM-1 and 
TM-2. 

Some CEA epitopes are unique. These are the epitopes which have been useful for distinguishing the 
various CEA-like antigens immunologically. Peptide epitopes are defined by the linear amino acid sequence 

35 of the antigen and/or features resulting from protein folding. The information required for protein folding is 
encoded in the primary amino acid sequence. Therefore, antigenic differences ultimately result from 
differences in the primary structure of the different CEA molecules. The differences residing In the CEA 
protein in the CEA species can thus be determined by determining the primary amino acid sequences. This 
can be most readily accomplished by cloning and sequencing each of the genes for CEA. To determine 

40 which gene products will be most useful for cancer diagnosis, unique probes can be selected for each gene 
and expression of each gene can be determined in different tumor types by nucleic acid hybridization 
techniques. The present invention provides a tool with which to identify potential genes coding for different 
members of the CEA family and to determine the theoretical primary amino acid sequences for them. Using 
the method of automated peptide synthesis, peptides can then be synthesized corresponding to unique 

45 sequences in these antigens. With these peptides, antibodies to these sequences can be produced which, 
in the intact CEA molecule, might not be recognized by the animal being immunized. Having accomplished 
this, advantage can then be taken of the differences In these antigens to generate specific immunoassays 
for the measurement of each antigen. 

A wide variety of host/cloning vehicle combinations may be employed in cloning the double-stranded 

50 nucleic acid prepared in accordance with this invention. For example, useful cloning vehicles may consist of 
segments of chromosomal, non-chromosomal and synthetic DNA sequences, such as various known 
derivatives of SV40 and known bacterial plasmids, e.g.. plasmids from E. coli including col E1, pCRI, 
pBR322, pMB89 and their derivatives, wider host range plasmids, e.g., RP4. and phage DNAs, e.g., the 
numerous derivatives of phage, e.g.. NM989, and other DNA phages, e.g.. Ml 3 and Filamenteous single- 

55 stranded DNA phages and vectors derived from combinations of plasmids and phage DNAs such as 
plasmids which have been modified to employ phage DNA or other expression control sequences or yeast 
plasmids such as the 2 u plasmid or derivatives thereof. Useful hosts may include bacterial hosts such as 
strains of E. coli, such as E. coli HB 101. E. coli XI 776, E. coli X2282, E. coli MRCI and strains of 
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Pseudomonas, Bacillus subtilis, Bacillus stearothermophilus and other E. coli , bacilli, yeasts and other fungi, 
animal or plant hosts such as animal (including human) or plant cells In culture or other hosts. Of course, 
not all host/vector combinations may be equally efficient. The particular selection of host/cloning vehicle 
combination may be made by those of skill in the art after due consideration of the principles sat forth 

5 without departing from the scope of this invention. 

Furthermore, within each specific cloning vehicle, various sites may be selected for insertion of the 
nucleic acid according to the present invention. These sites are usually designated by the restriction 
endonuclease which cuts them. For example, in pBR322 the PstI site is located in the gene for beta- 
lactamase, between the nucleotide triplets that code for amino acids 181 and 182 of that protein. One of the 

10 two Hindu endonuclease recognition sites is between the triplets coding for amino acids 101 and 102 and 
one of the several Taq sites at the triplet coding for amino acid 45 of beta-lactamase in pBR322. In similar 
fashion, the EcoRI site and the PVUII site in this plasmid lie outside of any coding region, the EcoRI site 
being located between the genes coding for resistance to tetracycline and ampicillin, respectively. These 
sites are well recognized by those of skill in the art. It is, of course, to be understood that a cloning vehicle 

75 useful in this invention need not have a restriction endonuclease site for insertion of the chosen DNA 
fragment. Instead, the vehicle could be cut and joined to the fragment by alternative means. 

The vector or cloning vehicle and in particular the site chosen therein for attachment of a selected 
nucleic acid fragment to form a recombinant nucleic acid molecule is determined by a variety of factors, 
e.g., the number of sites susceptible to a particular restriction enzyme, the size of the protein to be 

20 expressed, the susceptibility of the desired protein to proteolytic degradation by host cell enzymes, the 
contamination of the protein to be expressed by host cell proteins difficult to remove during purification, the 
expression characteristics, such as the location of start and stop codons relative to the vector sequences, 
and other factors recognized by those of skill in the art. The choice of a vector and an insertion site for a 
particular gene is determined by a balance of these factors, not all sections being equally effective for a 

25 given case. 

Methods of inserting nucleic acid sequences into cloning vehicles to form recombinant nucleic acid 
molecules include, for example, dA-dT tailing, direct ligation, synthetic linkers, exonuclease and 
polymerase-linked repair reactions followed by ligation, or extension of the nucleic acid strand with an 
appropriate polymerase and an appropriate single-stranded template followed by ligation. 

30 It should also be understood that the nucleotide sequences or nucleic acid fragments inserted at the 
selected site of the cloning vehicle may include nucleotides which are not part of the actual structural gene 
for the desired polypeptide or mature protein or may include only a fragment of the complete structural 
gene for the desired protein or mature protein. 

The cloning vehicle or vector containing the foreign gene is employed to transform an appropriate host 

35 so as to permit that host to replicate the foreign gene and to express the protein coded by the foreign gene 
or portion thereof. The selection of an appropriate host is also controlled by a number of factors recognized 
by the art. These include, for example, the compatibility with the chosen vector, the toxicity of proteins 
encoded by the hybrid plasmid. the ease of recovery of the desired protein, the expression characteristics, 
biosafety and costs. A balance of these factors must be struck with the understanding that not all hosts may 

40 be equally effective for expression of a particular recombinant DNA molecule. 

The level of production of a protein is governed by two major factors: the number of copies of its gene 
within the cell and the efficiency with which those gene copies are transcribed and translated. Efficiency of 
transcription and translation (which together comprise expression) is in turn dependent upon nucleotide 
sequences, normally situated ahead of the desired coding sequence. These nucleotide sequences or 

45 expression control sequences define Inter alia, the location at which RNA polymerase interacts to initiate 
transcription (the promoter sequence) and aTwhich ribosomes bind and interact with the mRNA (the product 
of transcription) to initiate translation. Not all such expression control sequences function with equal 
efficiency. It is thus of advantage to separate the specific coding sequences for the desired protein from 
their adjacent nucleotide sequences and fuse them instead to other known expression control sequences so 

50 as to favor higher levels of expression. This having been achieved, the newly engineered nucleic acid, e.g.. 
DNA, fragment may be inserted into a multicopy plasmid or a bacteriophage derivative in order to increase 
the number of gene copies within the cell and thereby further improve the yield of expressed protein. 

Several expression control sequences may be employed as described above. These include the 
operator, promoter and ribosome binding and interaction sequences (Including sequences such as the 

65 Shine-Dalgarno sequences) of the lactose operon of E. coli ("the lac system"), the corresponding 
sequences of the tryptophan synthetase system of E. coli ("the trp system"), the major operator and 
promoter regions of phage X (OlPl and OrP'r). the control region of Filamenteous single-stranded DNA 
phages, or other sequences which control the expression of genes of prokaryotic or eukaryotic cells and 
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their viruses. Therefore, to improve the production of a particular polypeptide in an appropriate host, the 
gene coding for that polypeptide may be selected and removed from a recombinant nucleic acid molecule 
containing it and reinserted into a recombinant nucleic acid molecule closer or in a more appropriate 
relationship to its former expression control sequence or under the control of one of the above described 

5 expression control sequences. Such methods are known in the art. 

As used herein "relationship" may encompass many factors, e.g., the distance separating the expres- 
sion enhancing and promoting regions of the recombinant nucleic acid molecule and the inserted nucleic 
acid sequence, the transcription and translation characteristics of the inserted nucleic acid sequence or 
other sequences in the vector itself, the particular nucleotide sequence of the inserted nucleic acid 

10 sequence and other sequences of the vector and the particular characteristics of the expression enhancing 
and promoting regions of the vector. 

Further increases in the cellular yield of the desired products depend upon an increase in the number 
of genes that can be utilized in the cell. This is achieved, for illustration purposes, by insertion of 
recombinant nucleic acid molecules engineered into the temperate bacteriophage X (NM989). most simply 

15 by digestion of the plasmid with a restriction enzyme, to give a linear molecule which is then mixed with a 
restricted phage X cloning vehicle (e.g., of the type described by N. E. Murray et al, "Lambdoid Phages 
That Simplify the Recovery of In Vitro Recombinants", Molec. Gen. Genet., 150, pp. 53-61 (1977) and N. E. 
Murray et al, "Molecular Cloning of the DNA LIgase Gene From Bacterioph'age T4", J, MoL Biol. , 132, pp. 
493-505 (1979)) and the recombinant DNA molecule recircularized by incubation with DNA ligase. The 

20 desired recombinant phage is then selected as before and used to lysogenize a host strain of E. coll. 

Particularly useful X cloning vehicles contain a temperature-sensitive mutation in the repression gene cl 
and suppressible mutations in gene S, the product of which is necessary for lysis of the host cell, and gene 
E. the product of which is major capsid protein of the virus. With this system, the lysogenic cells are grown 
at 32 and then heated to 45 'C to induce excision of the prophage. Prolonged growth at 37'C leads to 

26 high levels of production of the protein, which is retained within the cells, since these are not lysed by 
phage gene products in the normal way, and since the phage gene insert is not encapsulated it remains 
available for further transcription. Artificial lysis of the cells then releases the desired product in high yield. 

In addition, it should be understood that the yield of polypeptides prepared in accordance with this 
invention may also be Improved by substituting different codons for some or all of the codons of the present 

30 DNA sequences, these substituted codons coding for amino acids Identical to those coded for by the 
codons replaced. 

Finally, the activity of the polypeptides produced by the recombinant nucleic acid molecules of this 
invention may be improved by fragmenting, modifying or derivatizing the nucleic acid sequences or 
polypeptides of this invention by well-known means, without departing from the scope of this invention. 
35 The polypeptides of the present invention include the following: 

(1) the polypeptides expressed by the above described cells, 

(2) polypeptides prepared by synthetic means, 

(3) fragments of polypeptides (1) or (2) above, such fragments produced by synthesis of amino acids or 
by digestion or cleavage. 

40 Regarding the synthetic peptides according to the invention, chemical synthesis of peptides is 
described in the following publications: S.B.H. Kent, Biomedical Polymers, eds. Goldberg, E.P. and 
Nakajima, A. (Academic Press, New York), 213-242, (1980); A.R. Mitchell, S.B.H. Kent, M. Engelhard and 
R.B. Merrlfield, J. Org. Chem. . 43. 2845-2852, (1978): J.P. Tarn, T.-W. Wong, M. Riemen, F.-S. Tjoeng and 
R.B. Merrifieid, Tet. Letters ,"^033"-4036. (1979); S. Mojsov. A.R. Mitchell and R.B. Merrifield, J. Org. Chem. , 

45 45, 555-560, (1980); J.P. Tam, R.D. DiMarchi and R.B. Merrifield. Tet. Letters . 2851-2854, (1981); and 
STB.H. Kent, M. Riemen, M. Le Doux and R.B. Merrifield, Proceedings of the IV International Symposium on 
Methods of Protein Sequence Analysis, (Brookhaven Press, Brookhaven, NY), in press, 1981. 

In the Merrifield solid phase procedure, the appropriate sequence of L-amino acids is built up from the 
carboxyl terminal amino acid to the amino terminal amino acid. Starting with the appropriate carboxyl 

50 terminal amino acid attached to a polystyrene (or other appropriate) resin via chemical linkage to a 
chloromethyl group, benzhydrylamine group, or other reactive group of the resin, amino acids are added 
one by one using the following procedure. The peptide-resin is: 

(a) washed with methylene chloride; 

(b) neutralized by making for 10 minutes at room temperature with 5% (v/v) diisopropylethylamine (or 
55 other hindered base) In methylene chloride; 

(c) washed with methylene chloride; 

(d) an amount of amino acid equal to six times the molar amount of the growing peptide chain is 
activated by combining it with one-half as many moles of a carbodilmide (e.g., dicyclohexylcarbodiimide, 
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or diisopropylcarbodiimide) for ten minutes at 0 • C, to form the symmetric anhydride of the amino acid. 
The amino acid used should be provided originally as the N-alpha-tert.-butyloxycarbonyl derivative, with 
side chains protected with benzyl esters (e.g., aspartic or glutamic acids), benzyl ethers (e.g., serine, 
threonine, cysteine or tyrosine), benzyloxycarbonyl groups (e.g., lysine) or other protecting groups 
commonly used in peptide synthesis; 

(e) the activated amino acid is reacted with the peptide-resin for two hours at room temperature, resulting 
in addition of the new amino acid to the end of the growing peptide chain; 

(f) the peptide-resin Is washed with methylene chloride; 

(g) the N-alpha-(tert.-butyloxycarbonyl) group is removed from the most recently added amino acid by 
reacting with 30 to 65%. preferably 50% (v/v) trifluoroacetic acid in methylene chloride for 10 to 30 
minutes at room temperature; 

(h) the peptide-resin is washed with methylene chloride; 

(i) steps (a) through (h) are repeated until the required peptide sequence has been constructed. 

The peptide is then removed from the resin and simultaneously the side-chain protecting groups are 
removed, by reaction with anhydrous hydrofluoric acid containing 10% v/v of anisole or other suitable 
(aromatic) scavenger. Subsequently, the peptide can be purified by gel filtration, ion exchange, high 
pressure liquid chromatography, or other suitable means. 

In some cases, chemical synthesis can be carried out without the solid phase resin, in which case the 
synthetic reactions are performed entirely in solution. The reactions are similar and well known in the art, 
and the final product is essentially identical. 

Digestion of the polypeptide can be accomplished by using proteolytic enzymes, especially those 
enzymes whose substrate specificity results in cleavage of the polypeptide at sites immediately adjacent to 
the desired sequence of amino acids. 

Cleavage of the polypeptide can be accomplished by chemical means. Particular bonds between amino 
acids can be cleaved by reaction with specific reagents. Examples include the following: bonds involving 
methionine are cleaved by cyanogen bromide; asparaginyl-glycine bonds are cleaved by hydroxylamine. 

The present invention has the following advantages: 

(1) The nucleic acids coding for TM-1, TM-2 and TM-3 can be used as probes to isolate other members 
of the CEA gene family. 

(2) The nucleic acids coding for TM-1 . TM-2 and TM-3 can be used to derive oligonucleotide probes to 
determine the expression of TM-1 , TM-2. TM-3 and other CEA genes in various tumor types. 

(3) TM-1 , TM-2, TM-3 and TM-4 nucleotide sequences can be used to predict the primary amino acid 
sequence of the protein for production of synthetic peptides, 

(4) Synthetic peptides derived from the above sequences can be used to produce sequence-specific 
antibodies. 

(5) Immunoassays for each member of the CEA antigen family can be produced with these sequence- 
specific antibodies and synthetic peptides. 

(6) The aforementioned immunoassays can be used as diagnostics for different types of cancer if it is 
determined that different members of the CEA family are clinically useful markers for different types of 

cancer. 

Peptides according to the present invention can be labelled by conventional means using radioactive 
moieties (e.g., ^^1). enzymes, dyed and fluorescent compounds, just to name a few. 

Several possible configurations for immunoassays according to the present invention can be used. The 
readout systems capable of being employed in these assays are numerous and non-limiting examples of 
such systems include fluorescent and colorimetric enzyme systems, radioisotopic labelling and detection 
and chemiluminescent systems. Two examples of immunoassay methods are as follows: 

(1) An enzyme linked immunoassay (ELISA) using an antibody preparation according to the present 
invention (including Fab or F(ab)' fragments derived therefrom) to a solid phase (such as a microtiter 
plate or latex beads) is attached a purified antibody of a specificity other than that which is conjugated to 
the enzyme. This solid phase antibody is contacted with the sample containing CEA antigen family 
members. After washing, the solid phase antibody-antigen complex is contacted with the conjugated anti- 
peptide antibody (or conjugated fragment). After washing away unbound conjugate, color or fluorescence 
is developed by adding a chromogenic or fluorogenic substrate for the enzyme. The amount of color or 
fluorescence developed is proportional to the amount of antigen in the sample. 

(2) A competitive fluorometric immunoassay using fluorescently labelled peptide or synthetic peptides of 
the sequences for TM-2. TM-2, TM-3 and TM-4. In this example, the purified peptide expressed by cells 
or synthetic peptides thereof are fluorescently labelled. To a solid phase is attached a purified antibody. 
This solid phase is then contacted with sample containing CEA antigen family members to which has 
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been added fluorescent peptide probe. After binding, excess probe Is washed away the annount of bound 
probe is quantitated. The amount of bound fluorescent probe will be inversely proportional to the amount 
of antigen in the sample. 

In the nucleic acid hybridization method according to the present invention, the nucleic acid probe is 
5 conjugated with a label, for example, an enzyme, a fluorophore. a radioisotope, a chemiluminescent 
compound, etc. In the most general case, the probe would be contacted with the sample and the presence 
of any hybridizable nucleic acid sequence would be detected by developing in the presence of a 
chromogenic enzyme substrate, detection of the fluorophore by epifluorescence. by autoradiography of the 
radioisotopically labelled probe or by chemiluminescence. The detection of hybridizable RNA sequences 
70 can be accomplished by (1) a dot blot methodology or (2) an in situ hybridization methodology. Methods for 
these last two techniques are described by D. Gillespie and J. Bresser. "mRNA Immobilization in Nal: Quick 
Blots", Biotechniques, 184-192, November/December 1983 and J. Lawrence and R. Singer, "Intracellular 
Localization of Messenger RNAs for Cytosketal Proteins". Cell. 45, 407-415, May 9, 1986, respectively. The 
readout systems can be the same as described above, e.g., enzyme labelling, radiolabelling, etc. 
IS As stated above, the Invention also relates to the use in medicine of the aforementioned complex of the 
invention. 

The invention further provides a pharmaceutical composition containing as an active ingredient a 
complex of the invention in the form of a sterile and/or physiologically isotonic aqueous solution. 

For parenteral administration, solutions and emulsions containing as an active ingredient the complex of 
20 the invention should be sterile and. if appropriate, blood-isotonlc. 

It is envisaged that the active complex will be administered perorally, parenterally (for example, 
intramuscularly, Intraperitoneally, or Intravenously), rectally or locally. 

Example 1: Preparation of cDNA in pcE22 which codes for TM2-CEA [CEA-(e)l 

25 

Example la: RNA Preparation 

Messenger RNA was prepared by the proteinase K extraction method of J. Favolaro, R. Trelsman and 
R. Kamen, Methods in Enzymology, 65, 718, Academic Press, Inc. (1980), followed by oligo dT cellulose 

30 chromatography to yield poly A+ RNA (3'-polyadenylated eukaryotic RNA containing most mRNA se- 
quences that can be translated into polypeptides). To obtain approximately 100 ug of poly A+ RNA, 
approximately 3 x 10» cells of transfectant 23.411 (ATCC No. CRL 9731, deposited with the ATCC on June 
1, 1988), that expresses TM-1, TM-2, TM-3 and TM-4, Kamarck et al, Proc. Natl. Acad. Sci., USA. 84, 5350- 
5354, August 1987, were harvested from roller bottles after late logarithmic growth. Cells wereTysed by 

35 homogenization in an ice-cold solution of 140 mM NaCI, 1.5 mM MgCfe, 10 mM Tris-HCI. pH 8.0, 0.5% 
NP40®, 4 mM dithiothreitol and 20 units of placental ribonuclease inhibitor/ml. Sodium deoxycholate was 
then added to 0.2%. Cytoplasm and nuclei were separated by centrifugation of the homogenate at 12,000xg 
for 20 minutes. The cytoplasmic fraction was mixed with an equal volume of 0.2 M Tris-HCI, pH 7.8. 25 mM 
EDTA, 0.3 M NaCI, 2% sodium dodecyl sulfate and 400 ug/ml of proteinase K, incubated for 1 hour at 

40 37 •C, then extracted once with an equal volume of phenol/cholorform (1:1 /v:v) solution. Nucleic acids were 
obtained by ethanol precipitation of the separated aqueous phase. Total RNA was enriched by passage in 
0.5 M NaCI, 10 mM Tris-HCI, pH 7.8, 0.1% sarcosyl® through an oligo dT(12-18) cellulose column. After 
washing, bound RNA was eluted in the same solution without sodium chloride. 

45 Example lb: Reverse Transcription of mRNA 

Ten micrograms of poly A+ RNA were primed for reverse transcription with oligo dT(12-18) and pdNs 
primers. One hundred microliter reaction was performed for 4 hours at 42'C with 200 units AMV reverse 
transcriptase (Life Science, Inc. St. Petersburg, Florida, U.S.A.). The RNA component of the cDNA/mRNA 
50 hybrids was replaced with the second complementary strand by treatment with RNase H, E. coli DNA 
polymerase I and E. coli DNA ligase at 12 'C and 22 'C for 1.5 hours each. Molecular ends were~polished 
by treatment with~ tT'DNA polymerase. cDNA was phenol/chloroform extracted and purified over a 
"SEPHADEX® G-50" Spun column prepared in 10 mM Tris-HCI, pH 7.8, 1 mM EDTA (TE). 

55 Example 1c: Cloning of pcE22 (plasmid cDNA E22) 

Synthetic DNA linkers 5' pCCCGGG 3' 

3' GGGCCCTTAA 5' 
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were attached to the ends of cDNA by blunt end ligation with excess T4 DNA ligase. Excess linkers were 
removed by chromatography through "SEPHADEX® G-50" (medium) in TE, and by fractionation on 0.8% 
low melting agarose gel. Based on Northern blot analysis of poly A+ RNA of the 23.411 cell line, the size 
of the CEA-related mRNA was estimated at 3.6 kb. Therefore, cDNA fragments between 2 and 4 kb were 
5 recovered from gel slices and fragments were ethanol precipitated. After resuspension of cDNA in TE. 
EcoRI-cleaved lambda gt10 arms were added to cDNA at an estimated molar ratio of 1:1. Ligation 
proceeded at 7 • C for 2 days in the presence of T4 DNA ligase. Aliquots of the ligation reaction were added 
to commercially-obtained packaging mix (Stratagene. San Diego. California, U.S.A.). Five million phage 
particles were obtained ofter in vitro packaging and infection of E. coli host NM514. 

10 

Example Id : Screening of Recombinant Library 

Five hundred thousand packaged lambda particles were plated on lawns of E. coli NM514 and replicate 
patterns were lifted onto nitrocellulose sheets as described by W.D. Benton and RA/V. Davis, Science 196, 
75 180-182, (1977). Positive phage were selected by hybridization with ^zp-iabeled LV7 cDNA insert probelhat 
contained a domain repeated amoung various CEA family members. By multiple rounds of screening. 
Phage from individual plaques were amplified and titered, and these were used to prepare small quantities 
of recombinant phage DNA. 

20 Example 1e : DNA Manipulation 

Phage DNA was prepared according to T. Maniatis, E. Fritsch and J. Sambrook. Molecular Cloning. A 
Laboratory Manual, Cold Spring Habor, (1982). DNA segments were isolated from low melting agarose geli 
and inserted for subcloning into Bluescript plasmid vectors (Stratagene, San Diego, California, U.S.A.). DNA 
25 sequencing was performed by the dideoxy termination method of F. Sanger, S. Nicklen and A. Coulson, 
Proc. Natl. Acad. Sci.. U.S.A., 74. 5463-5467, (1977). The nucleic acid and translated sequence for cDNA in 
pcE22 is given hereinabove (TM'-2 (CEA-(e)). 

Example 2 : Preparation of cDNA in pcHT-6 which Partically Codes for TM3-CEA [CEA'(f)] 

30 

Example 2a: RNA Preparation 

Messenger RNA was prepared by the proteinase K extraction method of J. Favolaro, R. Treisman and 
R. Kamen, Methods in Enzymology, 65 718, Academic Press, Inc. (1980), followed by oligo dT cellulose 

35 chromatography to yield poly A+ rTJa (3'-polyadenylated eukaryotic RNA containing most mRNA se- 
quences that can be translated into polypeptides). To obtain approximately 100 ug of poly A+ RNA, 
approximately 3x10^ cells of HT-29 tumor cells (ATCC HTB38) were harvested form roller bottles after late 
logarithmic growth. Cells were lysed by homogenization in an ice-cold solution of 140 mM NaCl, 1.5 mM 
MgCb, 10 mM Tris-HCI, pH 8.0. 0.5% NP40®, 4 mM dithiothreitol and 20 units of placental ribonuclease 

40 inhibitor/ml. Sodium deoxycholate was then added to 0.2%. Cytoplasm and nuclei were separated by 
centrifugation of the homogenate at 12,000xg for 20 minutes. The cytoplasmic fraction was mixed with an 
equal volume of 0.2 M Tris-Hcl, pH 7.8, 25 mM EDTA. 0.3 M NaCI, 2% sodium dodecyl sulfate and 400 
Utg/ml of proteinase K, incubated for 1 hour at 37 'C, then extracted once with an equal volume of 
phenol/cholorform (1:1/v:v) solution. Nucleic acids were obtained by ethanol precipitation of the separated 

45 aqueous phase. Total RNA was enriched by passage in 0.5 M NaCI. 10 mM TrIs-HCI. pH 7.8, 0.1% 
sarcosyl® through an oligo dT(12-18) cellulose column. After washing, bound RNA was eluted In the same 
solution without sodium chloride. 

Example 2b : Reverse Transcription of mRNA 

60 

Ten micrograms of HT-29 poly A+ RNA were primed for reverse transcription with oligo dT(12-18) and 
pdNB primers. One hundred microliter reaction was performed for 4 hours at 42 •C with 200 units AMV 
reverse transcriptase (Life Science, Inc. St. Petersburg. Florida, U.S.A.). The RNA component of the 
cDNA/mRNA hybrids was replaced with the second complementary strand by treatment with RNase H. E. 
55 coli DNA polymerase I and E. coli DNA ligase at 12*C and 22 *C for 1.5 hours each. Molecular ends were 
polished by treatment with f4 DNA polymerase. cDNA was phenol/chloroform extracted and purified over a 
"SEPHADEX® G-50" spun column prepared in 10 mM Tris-HCI, pH 7.8. 1 mM EDTA (TE). 



AO 
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Example 2c : Cloning of pcHT-6 (piasmid cDNA HT-6) 

Synthetic DNA linkers 5' pCCCGGG 3' 

3* GGGCCCTTAA 5* 

5 were attached to the ends of cDNA by blunt end ligation with excess T4 DNA ligase. Excess linkers were 
removed by chromatography through "SEPHADEX® G-50" (medium) in TE, and by fractionation on 0.8% 
low melting agarose gel. Based on Northern blot analysis of poly A+ RNA of the HT-29 cell line, the size of 
the CEA-related mRNA was estimated at 2.2 kb. Therefore. cDNA fragments between 2 and 3 kb were 
recovered from gel slices and fragments were ethanol precipitated. After resuspension of cDNA in TE, 

10 EcoRI-cleaved lambda gtIO arms were added to cDNA at an estimated molar ratio of 1:1. Ligation 
proceeded at 7 * C for 2 days in the presence of T4 DNA ligase. Aiiquots of the ligation reaction were added 
to commercially-obtained packaging mix (Stratagene. San Diego, California, U.S.A.). Five million phage 
particles were obtained ofter in vitro packaging and infection of E. coli host NM514. 

75 Example 2d: Screening of Recombinant Library 



Five hundred thousand packaged lambda particles were plated on lawns of E. coli NM514 and replicate 
patterns were lifted onto nitrocellulose sheets as described by W.D. Benton and'FLW. Davis. Science, 196. 
180-182, (1977). Positive phage were selected by hybridization with 32p-iabeled LV7 cDNA insert probelhat 
20 contained a domain repeated amoung various CEA family members. By multiple rounds of screening. 
Phage from individual plaques were amplified and titered. and these were used to prepare small quantities 
of recombinant phage DNA. 

Example 2e: DNA Manipulation 

25 

Phage DNA was prepared according to T. Maniatis, E. Fritsch and J. Sambrook, Molecular Cloning, A 
Laboratory Manual, Cold Spring Habor, (1982). DNA segments were isolated from low melting agarose geli 
and inserted for subcloning into Bluescript piasmid vectors (Stratagene, San Diego. California, U.S.A.). DNA 
sequencing was performed by the dideoxy termination method of F. Sanger, S. NIcklen and A. Coulson, 

30 Proc. Natl. Acad. Sci., U.S.A., 74, 5463-5467, (1977). The nucleic acid and translated sequence for cDNA in 
HT-6 not complete at the 5' end of its coding region, but the nucleotide sequenece and restriction map of 
the HT-6 insert indicates that it is related to nucleic acid sequences of cDNA clones coding for CEA-(c) and 
CEA-(e). The nucleotide sequence of HT-6 insert differs from these clones at only nucleotide position 1463 
to 1515 and 1676 to 2429 of the CEA-(c) cDNA. It is inferred from this result that the pcHT-6 insert is a 

35 partial coding sequence for CEA-(f) and the presumed nucleic acid and translated sequence of CEA-(f) is 
given hereinabove (TM-3 (CEA-(f)). 

Example 3 : Preparation of cDNA which codes for TM4-CEA [CEA-(g)] 

40 Example 3a: RNA Preparation 

Messenger RNA is prepared by the proteinase K extraction method of J. Favolaro, R. Treisman and R. 
Kamen, Methos in Enzymology, 65, 718, Academic Press, Inc. (1980), followed by oligo dT cellulose 
chromatography to yield poly A+~" RNA (3'-polyadenylated eukaryotic RNA containing most mRNA se- 

45 quences that can be translated into polypeptides). To obtain approximately 100 ug of poly A+ RNA, 
approximately 3 x 10^ cells of transfectant 23.411 or tumor cell line HT-29 (ATCC HTB 38) are harvested 
from roller bottles after late logarithmic growth. Cells are lysed by homogenization in an ice-cold solution of 
140 mM NaCI, 1.5 mM MgCb, 10 mM Tris-HCI. pH 8.0, 0.5% NP40®. 4 mM dithiothreitol and 20 units of 
placental ribonuclease inhibitor/ml. Sodium deoxycholate was then added to 0.2%. Cytoplasm and nuclei 

50 are separated by centrifugation of the homogenate at 12,000xg for 20 minutes. The cytoplasmic fraction is 
mixed with an equal volume of 0.2 M Tris-Hcl, pH 7.8, 25 mM EDTA, 0.3 M NaCI. 2% sodium dodecyl 
sulfate and 400 ug/ml of proteinase K, incubated for 1 hour at 37 *C, then extracted once with an equal 
volume of phenol/cholorform (1:1 /v:v) solution. Nucleic acids are obtained by ethanol precipitation of the 
separated aqueous phase. Total RNA is enriched by passage in 0.5 M NaCI, 10 mM Tris-HCI, pH 7.8, 0,1% 

55 sarcosyl through an oligo dT(12-18) cellulose column. After washing, bound RNA is eluted in the same 
solution without sodium chloride. 
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Example 3b: Reverse Transcription of mRNA 



Ten micrograms of 23.411 or HT 29 poly A+ RNA are primed for reverse transcription with oligo dT(12- 
18) and pdNe primers. One hundred microliter reaction was performed for 4 hours at 42'C with 200 units 
5 AMV reverse transcriptase (Life Science. Inc. St. Petersburg. Florida, U.S.A.). The RNA component of the 
cDNA/mRNA hybrids is replaced with the second complementary strand by treatment with RNase H. E. coli 
DNA polymerase I and E. coli DNA ligase at 12 "C and 22 "C for 1.5 hours each. Molecular ends are 
polished by treatment with f4~DNA polymerase. cDNA is phenol/chloroform extracted and purified over a 
"SEPHADEX® G-50" spun column prepared in 10 mM Tris-HCI. pH 7.8, 1 mM EDTA (TE). 

10 

Example 3c: Cloning of cDNA for CEA"(g) 

Synthetic DNA linkers 5* pCCCGGG 3* 

3' GGGCCCTTAA 5' 

75 are attached to the ends of cDNA by blunt end ligation with excess T4 DNA ligase. Excess linkers are 
removed by chromatography through "SEPHADEX® G-50" (medium) in TE. and by fractionation on 0.8% 
low melting agarose gel. Based on Northern blot analysis of poly A + RNA of the 23.41 1 and HT-29 cell 
lines, the size of the CEA-related mRNA is estimated at 1 .7 kb. Therefore, cDNA fragments between 1 and 
2 kb are recovered from gel slices and fragments are ethanol precipitated. After resuspension of cDNA in 

20 TE, EcoRI-cleaved lambda gtIO arms are added to cDNA at an estimated molar ratio of 1:1. Ligation 
proceeds at 7 • C for 2 days in the presence of T4 DNA ligase. Aliquots of the ligation reaction are added to 
commercially-obtained packaging mix (Stratagene, San Diego. California. U.S.A.). Phage particles are 
obtained after in vitro packaging and infection of E. coli host NM514. 



25 Example 3d: Screening of Recombinant Library 



Five hundred thousand to one million packaged lambda particles are plated on lawns of E. coli NM514 
and replicate patterns are lifted onto nitrocellulose sheets as described by W.D. Benton and R.W. Davis, 
Science, 196, 180-182, (1977). Positive phage are selected by hybridization with ^^p-iabeled LV7 cDNA 
30 insert probe that contained a domain repeated amoung various CEA family members. By this selection 
method, positive phage are obtained after multiple rounds of screening. Phage from individual plaques are 
amplified and titered. and these are used to prepare small quantities of recombinant phage DNA. 

Example 3e: DNA Manipulation 

35 

Phage DNA is prepared according to T. Maniatis. E. Fritsch and J. Sambrook. Molecular Cloning . A 
Laboratory Manual. Cold Spring Harbor, (1982). DNA segments are isolated from low melting agarose gels 
and inserted for subcloning into Bluescript plasmid vectors (Stratagene, San Diego. California, U.S.A.). DNA 
sequencing is performed by the dideoxy termination method of F. Sanger, S. Nicklen and A. Coulson, Proc. 
40 Natl. Acad. Sci., U.S.A.. 74, 5463-5467. (1977). The nucleotide and translated sequence for a cDNA coding 
for CEA-(g) is given hereinabove (TM-4 (CEA-(g)). 

Example 4: Screening of a KG-1 cDNA Library with ^^P-labelled CEA Probe, LV7 (CEA-(A)) 



45 A segment of cDNA coding for a portion of carcinoembryonic antigen [LV7 or CEA-(a)] was radiolabel- 
led by random priming and used to detect homologous sequences on filter replicas of a commercial cDNA 
library prepared from KG-1 cells in bacteriophage vector X gtl 1 (Clontech Laboratories, Inc., Palo Alto. CA.. 
U.S. A.). Hybridizations were performed at 68 'C In 2xSSSPE (IxSSPE - 0.15 M NaCI. 0.01 M sodium 
phosphate and 1 mM EDTA, pH 7), 5x Denhardfs solution and 100 ug of denatured salmon spemn DNA per 

50 ml, and post-hybridization washes were in 0.2xSSC, 0.25% sodium dodecyl sulfate. 

Positive phage were picked, rescreened to homogeneity, and amplified for production of DNA. cDNA 
inserts were excised from phage DNA with EcoRI endonuclease and subcloned into the EcoRI site of the 
plasmid vector pBluescript KS. DNA sequencing on double-stranded DNA was by the method of Sanger et 
al, supra. The sequences of two different inserts from the KG-1 cDNA library are shown below: 

55 
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70 



75 



pcKGCEAl: 

1 acagcacagctgacagccgtactcaggaagcttctggatcctaggcttatctccacagag 60 

61 gagaacacacaagcagcagagaccatggggcccctctcagcccctccctgcacacacctc 120 

Me tGlyProLeuSerAlaProProCysThrHi sLeu 

121 atcacttggaagggggtcctgctcacagcatcacttttaaacttctggaatccgcccaca 18 0 
IleThrTcpLysGlyValLeuLeuThrAlaSerLeuLeuAsnPheTrpAsnProProThr 

181 actgcccaagtcacgattgaagcccagccacccaaagtttctgaggggaaggatgttctt 240 
ThrAlaGlnValThrlleGluAlaGlnProProLysValSerGluGlyLysAspValLeu 

241 ctacttgtccacaatttgccccagaatcttgctggctacatttggtacaaagggcaaatg 300 
LeuLeuValHisAsnLeuProGlnAsnLeuAlaGlyTyrlleTrpTyrLysGlyGlnMet 

301 acatacgtctaccattacattacatcatatgtagtagacgg:.tcaaagaattatatatggg 36 0 
ThrTyrValTyrHisTyrlleThrSerTyrValValAspGlyGlnArgllelletyrGly 

361 cctgcatacagtggaagagaaagagt^tattccaatgcatccctgctgatccagaatgtc 420 
ProAlaTyrSerGlyArgGluArgValTyrSerAsnAlaSerLeuLeulleGlnAsnVal 

421 acgcaggaggatgcaggatcctacaccttacacatcataaagcgacgcgatgggactgga 480 
ThrGlnGluAspAlaGlySerTyrThrLeuHisIlelleLysAcgArgAspGlyThrGly 

481 ggagtaactggacatttcaccttcaccttacacctggagactcccaagccctccatctcc 540 
GlyValThrGlyHisPheThrPhoThrLeuHisLeuGluThrProLysProSerlleSer 

25 541 agcagcaacttaaatcccagggaggccatggaggctgtgatcttaacctgtgatcctgcg 600 
SerSerAsnLeuAsnProArgGluAlaNetGluAlaVallleLeuThrCysAspProAla 

601 actccagccgcaagctaccagtggtggatgaatggtcagagcctccctatgactcacagg 660 
ThrProAlaAlaSerTyrGlnTrpTrpKetAsnGlyGlnSerLeuProNetThrHisArg 

o61 ttgcagctgtccaaaaccaacaggaccctctttatatttggtgtcacaaagtatattgca 720 
LeuGlnLeuSerLysThrAsnArgThrLeuPhellePheGlyValThrLysTyrlleAla 

721 ggaccctatgaatgtgaaatacggaacccagtgagtgccagccgcagtgacccagtcacc 780 
GlyProTyrGluCysGluIleArgAsnProValSerAlaSerArgSerAspProValThr 

781 ctgaatctcctcccaaagctgtccaagccctacatcacaatcaacaacttaaaccccaga 84 0 
LeuAsnLeuLeuProLysLeuSerLysProTyrlleThrlleAsnAsnLeuAsnProArg 

841 gagaataaggatgtcttaaccttcacctgtgaacctaagagtgagaactacacctacatt 900 
GluAsnLysAspValLeuThrPheThrCysGluProLysSerGluAsnTyrThrTyrlle 

^ 901 tggtggctaaatggtcagagcctccctgtcagtcccagggtaaagcgacccattgaaaac 960 
TcpTrpLeuAsnGlyOlnSerLeuProValSerProArgValLysAcgProIleGluAsn 

961 aggatcctca ttctacccaatgtcacgagaaatgaaacaggaccttatcaatgtgaaa ta 1020 
ArglleLeuIleLeuProAsnValThrArgAsnGluThrGlyProTyrGlnCysGlulle 

45 1021 cgggaccgatatggtggcatccgcagtgacccagtcaccctgaatgtcctctatggtcca 1080 
ArgAspArgTyrGlyGlylleArgSerAspPcoValThrLeuAsnValLeuTyrGlyPro 
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1081 gacctccccagcatttacccttcattcacctattaccgttcaggagaaaacctctacttt 1140 
AspLeuPcoSerlleTyrProSerPheThrTyrTyrArgSerGlyGluAsnLeuTycPhe 

1141 tcctgcttcggtgagtctaaoccacgggcacaatattcttggacaattaatgggaagttt 1200 
SerCysPheGlyGluSerAsnProArgAlaGlnTyrSerTrpThcIleAsnGlyLysPhe 

1201 cagctatcaggacaaaagctctctatcccccaaataactacaaagcatagtgggctctat 1260 
GlnLeuSerGlyGlnLysLeuSerlleProGlnlleThrThrLysHisSerGlyLeuTyr 

1261 gcttgctctgtfccgtaactcagccactggcaaggaaagctccaaatccatcacagtcaaa 1320 
AlaCysSerValArgAsnSerAlaThrGlyLysGluSerSerLysSerlleThryalLys 

1321 gtctctgactggatattaccctgaattctactagttcctccaattccattttctcccatg 1380 
ValSerAspTrpZleLeuProEnd 



1-^1 gaatcacgaagagcaagacccactctgttccagaagccctataatctggaggtggacaac 1440 

1 ^ ^ 1 tcgatgtaaatttcatgggaaaacccttg tacctgacatg tgagccactcagaac tcacc 1500 

1501 aaaa tg ttcgacaccataacaacagc tac tcaaactgtaaaccaggataagaag ttgatg 15 60 

1561 ac ttcac actg tggacag tttttcaaaga tgtcataacaagactccccatcatgacaagg 1620 

1621 ctccaccctctactgtctgctcatgcctgcctctttcacttggcaggataatgcagtcat 1680 

1681 tagaatttcacatgtagtagcttctgagggtaacaacagagtgtcagatatgtcatctca 1740 

1741 acctcaaacttttacgtaacatctcagggaaatgtggctctctccatcttgcatacaggg 1800 

1801 ctcccaatagaaatgaacacagagatattgcctgtgtgtttgcagagaagatggtttcta 1860 

1861 taaagagtaggaaagctgaaattatagtagagtctcctttaaatgcacattgtgtggatg 1920 

1921 gctctcaccatttcctaagagatacagtgtaaaaacgtgacagitaatactgattctagca 1980 

1981 gaataaacatgtaccacatttgcaaaaaa 

PCKGCEA2: 

1 gggtggatcctaggctcatctccataggggagaacacacatacagcagagaccatggga 59 

MetGly 

60 cccctctcagcccctccctgcactcagcacatcacctggaaggggctcctgctcacagca 119 
ProLeuSerAlaProPcoCysThrGlnHisIleThrTrpLysGlyLeuLeuLeuThrAla 

120 tcacttttaaacttctggaacctgcccaccactgcccaagtaataattgaagcccagcca 179 
SerLeuLeuAsnPheTrpAsnLeuProThrXhrAlaGlnValllelleGluAlaGlnPro 

180 cccaaagtttctgaggggaaggatgttcttctacttgtccacaatttgccccagaatctt 239 
ProLysValSerGluGlyLysAspValLeuLeuLeuValHisAsnLeuProGlnAsnLeu 

240 ac tggctacatctggtacaaagggcaaatgacggacctctaccattacattacatcatat 29.9 
ThrGlyTyrlleTrpTyrLysGlyGlnMetThrAspLeuTy.rHisTyrlleThrSerTyr 

300 gtagtagacggtcaaattatatatgggcctgcctacagtggacgagaaacagtatattcc 359 
ValValAspGlyGlnllelleTyrGlyProAlaTyrSerGlyArgGlulrhrValTyrSer 

360 aa tgcatccctgctgatccagaatgtcacacagga-^gatgcaggatcctacaccttacac 419 
AsnAlaSerLeuLeuXleGlnAsnValThrGlnGluAspAlaGlySerTyrThcLeuHis 

4 20 atcataaagcgaggcgatgggactggaggagtaactggatatttcaetgtcaccttatac 479 
IlelleLysArgGlyAspGlyThrGl'yGlyValThrGlyTyrPheThrValThrLeuTyr 

480 tcggagactcccaagcgctccatctccagcagcaacttaaaccccagggaggtcatggag 539 
SerGluThrProLysArgSerlleSerSerSerAsnLeuAsnProArgGluValnetGlu 
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54 0 gctgtgcgcttaatctgtgatcctgagactccggatgcaagctacctgtggttgc tgaat 599 
AlaValArgLeuIleCysAspProGluThrProAspAlaSerTyrLeuTcpLeuLeuAsn 

600 ggtcagaacctccctatgactcacaggttgcagctgtccaaaaccaacaggaccctctat 659 
5 GlyGlnAsnLeuProMetThrHisArgLeuGlnLeuSerLysThrAsnArgThrLeuTyr 

660 ctatttggtgtcacaaagtatattgcagggccctatgaatgtgaaatacggaggggagtg 719 
LeuPheGlyValThrLysTyrlleAlaGlyProTyrGluCysGluIleArgArgGlyVal 

7 20 agtgccagccgcagtgacccagtcaccctgaatctcctcccgaagc tgcccatgccttac 779 
10 SerAlaSerArgSerAspProValThrLeuAsnLeuLeuProLysLeuProNetProTyr 

780 atcaccatcaacaacttaaaccccagggagaagaaggatgtgttagccttcacctgtgaa 839 
IleThrlleAsnAsnLeuAsnProArgGluLysLysAspValLeuAlaPheThrCysGlu 

840 cctaagagtcggaactacacctacatttggtggctaaatggtcagagcctcccggtcagt 899 
75 ProLysSerArgAsnT^rThrTyrlleTtpTrpLeuAsnGlyGlnSerLeuProValSer 

9 00 ccgaggg taaa.gcgaccca ttgaaaacaggatactcattctacccagtgtcacgagaaat 9 59 
PcoArgValLysArgProIleGluAsnAcglleLeuIleLeuPcoServalThrArgAsn 

960 gaaacaggaccctatcaatgtgaaatacgggaccgatatggtggca tccgcagtaaccca 1019 
2Q GluThrGlyProTyrGlnCysGluIleAcgAspArgTyrGlyGlylleArgSerAsnPro 

1020 gtcaccctgaatgtcctctatggtccagacctccccagaatttaccc.ttacttcacctat .1079 
ValThrLeuAsnValLeuTyrGIyProAspLeuProArglleTyrProTyrPheThrTyr 



25 



30 



35 



40 



45 



1080 tdccgttcaggagaaaacctcgacttgtccl:gctttgcggactctaacccaccggcagag 1139 
TyrArgSerGlyGluAsnLeuAspLeuSerCysPheAlaAspSerAsnProProAlaGlu 

114 0 tatttttggacaattaatgggaagtttcagctatcaggacaaaagctctttatcccccaa 1199 
TyrPheTrpThrlleAsnGlyLysPheGlnLeuSerGlyGlnLysLeuPhelleProGln 

1 ^0 attactacaaatcatagcgggctctatgcttgctctgttcgtaactcagccactggcaag 1259 
IleThrThrAsnHisSerGlybeuTyrAlaCysSe rValAcgJ^snSe cAlaThcGlyLys 

1260 gaaatctccaaatccatgatagtcaaagtctctggtccctgccatggaaaccagacagag 1319 
GluIleSerLysSernetlleValLysValSerGlyPcoCysHisGlyAsnGlnThrGlu 

1320 tctcattaatggctgccacaatagagacactgagaaaaagaacaggttgataccttcatg 1379 
SerHisCnd 

1380 dddttcaagacaaagaagaaaaaggctcaatgttattggactaaataatcaaaagga taa 14 39 

14 40 tgttttcataatttttattggaaaatgtgctgattcttggaatgtttt'attctccagatt 14 99 

1500 tatgaactttttttcttcagcaattggtaaagtatacttttgtaaacaaaaattgaaaca 1559 
1560 tttgcttttgctctctatctgagtgccccccc 1591 



It will be appreciated that the instant specification and claims are set forth by way of illustration and not 
limitation and that various modifications and changes may be made without departing from the scope of the 
present invention. 



Claims 



A nucleic acid comprising a base sequence which codes for a peptide sequence, characterized in that 
the group nucleic acid is a DNA selected from the following group of five sequences: 
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10 30 50 

CAGCCCTGCTCGAAGCGTTCCTGGAGCCCAAGCTCTCCTCCACAGGTGAAGACACGGCCA 

5 

70 90 110 

GCAGGAGACACCATGGGGCACCTCTCAGCCCCACTTCACAGAGTGCGTGTACCCTGGCAG 
HetGlyHisLeuSerAlaProLeuHisArgValAryValProTrpGln 

10 

130 150 170 

GCCCTTCTGCTCACAGCCTCACTTCTAACCTTCTGGAACCCGCCCACCACTGCCCAGCIC 
75 GlyLeuLcuLeuThcAlaSe r Leu LeuTh c PheT r pA sn? r o? c oTh rTh r A 1 ftG i nLe -j 

190 210 230 

20 ACTACTGAATCCATGCCATTC^^TGTTGCAGAGGGGA^'-xGGAGGTTCTTCTCCTTGTCCAC 
ThrThrGluSe r.Me t ? roPheAsnVa 1 Al aGluGlyLysGluVal LeuLeuLeuValH i s 
250 270 290 

A.^TCTGCCCCAGCAACTTTTTGGCTACAGCTGGTACAAAGGGGAAAGAGTGGATGGC/'.AC 
25 AsnLeu? coGlnGlnLeuPheGlyTy r Se rTr pTy r LysGlyGluAr cjValAspGl yAsn 



310 330 350 

30 CGTCAA.^TTGTAGGATATGCAATAGG/\/kCTC;vAC/^J^-.GCTACCCCAGGGCCCGC;\uAACAGC 
ArcGlnl leValGlyTy r AlalleGlyThiGlnGlnAlaThrProGlyProAl aAsnSe r 



370 390 410 

35 ..... 

GGTCGAGAGACAATATACCCCAATGCATCCCTGCTGATCCAGAAXGTCACCCAGAATGAC 
GlyArgGluThc 1 1 cTy r PcoAsnAlaSe r LeuLeuI IcGlnAsnValThrGlnA&nAsp 



40, OO 450 470 

ACAGGATTCTACACCCTACAAGTCATAAAGTCAGATCTTGTGAATG^AGAAGCAACTGGA 
ThrGlyPheTy cThcLcuGlnVall IcLysSerAspLeuValAsnGluGluAldThrGly 

45 
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<90 510 530 

CAGTTCCATGTATACCCGGAGCTGCCCAAGCCCTCCATCTCCAGCAAC^ACTCCAACCCT 
s GlnPheHisValTycPcoGluLeuProLysProSerlleSerSe cAsnAsnSe rAsnPro 



550 5-jp. 590 

10 GTGGAGGACA-^GGATGCTGTGCCCTTCACCTGTGAACCTGAGACTCAGCACACAACCTAC 
ValGluAspLysAspAlaValAlaPheThrCysGluProGluThrGlnAspThrThrTyr 



75 



610 630 650 

CTGTGGTGGATAAACAATCAGAGCCTCCCGGTCAGTCCCAGGCTGCAGCTGTCC.*wATGGC 
LeuTrpTcpIleAsnAsnGlnSerLeuProValSerProArgLeuGlnLeuSerAsiiGly 



670 690 710 

AACAGGACCCTCACTCTACTCAGTCTCACAAGGAATCACACAGCACCCTATCAGTCTGAA 
AsnArgThcLeuThcLeuLeuSecValThrArgAsnAspThtClyPcoTycGluCysClu 



25 750 770 

ATACAGAACCCAGTGAGTGCGAACCCCAGTGACCCAGTCACCTTCA.\TGTCACCTATGGC 
IleGlnAsnProValSerAlaAsnArgSetAspProValThrLeuAsnValThcTyrGly 

30 

79 0 610 830 

CCGGACACCCCCACCATTTCCCCTTCAGACACCTATTACCGTCCAGGGGCAAACCTCAGC 

ProAspThrProThr IleSe cPtoSc r AspThiTy rTy r AcgP r oGlyAl aAsnLeuSe r 

35 

850 870 S90 

CTCTCCTGCTATGCAGCCTCTAwACCCACCTGCACAGTACTCCTGGCTTATCAATCGAACA 
LeuSerCysTyrAlaAlaSe rAsnP r oProAlaGl nTy rSe cTi pLeul 1 eAs nGl yTh r 



910 930 950 

« TTCCAGCA-^GCACACAAGAGCTCTTTATCCCTAACATCACTGTGAATAJ^TAGTGGATCC 
PheGlnGlnSerThrGlnGluLeuPhelleProAsnlleThrValAsnAsnSerGlySer 



so 



970 990 1010 

TATACCTCCCACCCCAATAACTCAGTCACTGGCTGCAACAGGACCACACTCAACACGATC 
TytThtCysHlsAlaAsnA&nSe rVa iThcGl yCy sAsnAr gTh t Th r va 1 Ly sTh r lie 
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1030 1050 1070 

ATAGTCACTGATAATGCTCTACCACAAGAAAATGGCCTCTCACCTCGGCCCATTGCTGGC 
IleValThrAspAsnAlaLeuProGlnGluAsnGlyLeuSerProGlyAlalleAlaGly 

1090 1110 1130 

ATTCTGATTGGAGTAGTGGCCCTCGTTGCTCTGATACCAGTAGCCCTGGCATGTTT7CTC 
1 leval IleGlyValvalAlaLeuValAlaLeuIleAlaVdlAlaLeuAlaCysPheLeu 

1150 1170 1190 

CATTTCGGGAAGACCGGCAGGGCA-^GCGACCAGCGTGATCTCACAGAGCACAAACCCTCA 
HisPheGlyLysThrGlyArgAlaSerAspGlnArgAspLeuThrGluHisLysProSer 

1210 1230 1250 

GTCTCCAACCACACTCAGGACCACTCCAATGACCCACCTAACAAGATGAATGAAGTTACT 
ValSe cAsnHisThcGlnAspHisSe lAsnAspPcoProAsnLysMe tAsnGluvalThr 

1270 1290 1310 

TATTCTACCCTGAACTTTGAAGCCCAGCAACCCACACAACCAACTTCAGCCTCCCCATCC 
Ty rSe rThcLeuAsnPheGluAlaGlnGlnProThcGlnProThrSc rAlaSe rProSe r 

1330 1350 1370 

CTAACAGCCACAGAAATAATTTATTCAGAJ\GTAAAAAAGCAGTAATGAAACCTGTCCTGC 
LeuThrAlaThcGluI lelleTyrSerGluValLysLysGln 

1390 1410 1430 

TCACTGCAGTGCTGATGTATTTCA;\GTCTCTCACCCTCATCACTAGGAGATTCCTTTCCC 

1450 1470 1490 

CTGTAGGGTAGAGGGGTGGGGACAGAAACAACTTTCTCCTACTCTTCCTTCCTAATAGGC 

1510 1530 1550 

ATCTCCAGGCTGCCTGGTCACTGCCCCTCTCTCAGTGTCAATAGATGAAAGTACATTGGG 

1570 1590 1610 

AGTCTGTAGGAAACCCAACCTTCTTGTCATTGAAATTTGGCAAAGCTGACTTTGGGAAAG 
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1630 1650 1670 

• • • . • 

ACGGACCAGAACT7CCCCTCCCTTCCCCTTTTCCCAACCTGCACTTGTTTTAAACTTGCC 

5 

IfiaO 17.10. . .....1730.. 

TGTTCAGACCACTCATTCCTTCCCACCCCCAGTCCTGTCCTATCACTCTAATTCGGATTT 

70 

I'^SO 1770 1790 

GCCATAGCCTTGAGGTTATGTCCTTTTCCATTAAGTACATGTGCCAGGAAACAGCGAGAG 

1810 1830 1850 

AGAGAAAGTAAACGGCAGTAATGCTTCTCCTATTTCTCCAAAGCCTTGTGTGAACTAGCA 

20 

1870 1890 1910 

* • • • . 

AAGAGAACAAAATCAAATATATAACCAiATAGTGAAATGCCACAGGTTTGTCCACTGTCAG 

25 

1930 1950 1970 

GGTTGTCTACCTGTAGGATCAGGGTCTAAGCACCTTGGTGCTTAGCTAGAATACCACCTA 

30 

1990 2010 2030 

ATCCTTCTGGC.ftJ^GCCTGTCTTCAGAGAACCCACTAGAAGCAACTAGGAAAA-ATCACTTG 

35 

2050 2070 2090 

CCAAAATCCAAGGCAJ\TTCCTGATGGAAAATGC/'JV/vAGCACATATATGTT'I^AATATCT 

40 

2110 2130 2150 

TATGGGCTCTGTTCAAGGCAGTGCTGAGAGCGAGGGGTTATAGCTTCAGGAGGGAACCAG 

45 

2170 2190 2210 

CTTCTGATAAAqACAATCTGCTAGGAACTTGGGAAAGGAATCAGAGAGCTGCCCTTCAGC 

50 
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2230 2250 2270 

. • • • • • 

CATTATTTAAATTGTTAAACAATACACAATTTCCCGTATTCGGATTTTTCTCCTTTTCTC 
2290 2310 2330 

• • • « • ■ 

TCACACA.TTGCACCA-TTTTAATTTTTCTAACTCC.TTAT.TTAT.CTCAAAA&CCT.XA.TXXTT.... 

2350 2370 2390 

ACTTAGCTTAGCTATGTCACCCAATCCCATTGCCTTACGTGAAAGAAACCACCGAAATCC 

2410 - 2430 • 2450 

CTCAGGTCCCTTCGTCAGGAGCCTCTCAAGATTTTTTTTCTCAGAGGCTCCAAATAGAAA 

2470 2490 2510 

« ■ ■ • • • • 

ATAAGAAAAGGTTTTCTTCATTCATCGCTAGAGCTACATTT/iACTCAGTTTCTAGGCACC 

2530 2550 2570 

TCAGACCAATCATCAACTACCATTCTATTCCATGTTTGCACCTGTGCATTTTCTGTTTGC 

2590 2610 2630 

CCCCATTCACTTTGTCAGGAAACCTTCGCCTCTGCTAAGGTGTATTTGGTCCTTGACAAG 

2650 2670 2690 

TGGGAGCACCCTACAGGGACACTATCACTCATGCTGGTGGCATTGTTTACAGCTAGit^-t^G 

2710 2730 2750 

CTGCACTGGTGCTAATGCCCCTTGGGA;\ATGGGGCTGTGAGGAGGAGGATTAT/^J\CTTAG 

2770 2790 2010 

GCCTAGCCTCTTTTAACAGCCTCTGAAATTTATCTTTTCTTCTATGGGGTCTATAAATGT 

2830 2050 2870 

V .... • 

ATCTTATAATAAAAAGGAAGGACAGGAGGAAGACAGGCAAATGTACTTCTCACCCAGTCT 



CO 
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2890 2910 2930 

TCTACACAGATGGAATCTCTTTCGGCCTAACAGKAAGGTTTTATTCTATATTGCTTACCT 

5 

2950 2970 2990 

• • • • • • 
GATCTCATGTTAGGCCTAAGAGGCTTTCTCCAGGAGGATTAGCTTGGAGTTCTCTATACT 

10 

3010 3030 3050 

• • • ■ • > 
CAGGTACCTCTTTCAGGGTTTTCTAACCCTGACACGGACTGTGCATACTTTCCCTCATCC 

3070 3090 3110 

ATGCTGTGCTGTGTTATTTAATTTTTCCTGGCTAAGATCATGTCTGA/^TTATGTATGAAA 

20 

3130 3150 3170 

ATTATTCTATGTTTTTATAATAAAAATAATATATCAGACATCGAAA/vA.^AAAA , 

25 
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(2) 

5 10 30 50 

CAGCCGTGCTCGAAGCGTTCCTGGAGCCCAAGCTCTCCTCCACAGGTGAAGACAGGGCCA 



70' ■ 



70 90 110 

GCAGGAGACACCATGGGGCACCTCTCAGCCCCACTTCACAGAGTGCGTGTACCCTGGCAG 
Me tGlyHi sLeuSerAlaProLeuHisArgValArgValProTrpGln 

130 150 170 

GGGCTTCTGCTCACAGCCTCACTTCTAACCTTCTGGAACCCGCCCACCACTGCCCAGCTC 
GlyLeuLeuLeuThrAlaSe r LeuLeuThrPheTrpAsnProProThrThrAlaGlnLeu 

20 

190 210 230 

• '• • • • 
ACTACTGAATCCATGCCATTCAATGTTGCAGAGGGGAAGGAGGTTCTTCTCCTTGTCCAC 
ThrThrGluSerWetProPheAsnValAlaGluGlyLysGluValLeuLcuLcuValHis 

25 

250 270 290 

AATCTGCCCCAGCAACTTTTTGGCTACAGCTGGTACAAAGGGGAAAGAGTGGATGGCAAC 
30 AsnLeuProGlnGlnLeuPheGlyTy rSe cTrpTy rLysGlyGluArgValAspGlyAsn 

310 330 350 

* • • • • • 

36 CGTCAAATTGTAGGATATGCAATAGGAACTCAACAAGCTACCCCAGGGCCCGCAAACAGC 
ArgGlnl leValGlyTyrAlalleGlyThrGlnGlnAlaThrProGlyProAlaAsnSer 

370 390 410 

40 ' ' 

GGTCGAGAGACAATATACCCCAATGCATCCCTGCTGATCCAGAACGTCACCCAGAATGAC 
GlyArgGluThrlleTyrProAsnAlaSerLeuLeuIleGlnAsnValThrGlnAsnAsp 
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430 450 470 

■ACAG■CATrCTACACCtTKCK^tJTCATAAAGTCAGAtCTTCTGAATG^^GA»^GCJ>ACTGG^^ 
ThrGlyPheTycThrLeuGlnVallleLysSerAspLeuValAsnGluGluAlaThrGly 



490 510 530 

,«•••• 

CAGTTCCATGTATACCCGGAGCTGCCCAAGCCCTCCATCTCCAGCAACAACTCCAACCCT 
GlnPheHisValTyrPro'GluLeuProLysProSerlleSerSerAsnAsnSerAsnPro 



550 570 590 

• ••••• 

75 GTGGAGGACAAGGATGCTGTGGCCTTCACCTGTGAACCTGAGACTCAGGACACAACCTAC 
ValGluAspLysAspAlaValAlaPh.eThrCysGluProGluThrGlnAspThrThrTy r 



610 630 650 

CTGTGGTGGATAAACAATCAGAGCCTCCCGGTCAGTCCCAGGCTGCAGCTGTCCAATGGC 
LeuTrpTrpI leAsnAsnGlnSe rLeuProVal Se rProArgLeuGlnLeuSe rAsnGly 



670 690 710 

AACAGGACCCTCACTCTACTCAGTGTCACAAGGAATGACACAGGACCCTATGAGTGTGAA 
AsnArgThr LeuThrLeuLeuSe cvalThrArgAsnAspThrGlyProTy rGluCysGlu 



730 750 770 

ATACAGAACCCAGTGAGTGCGAACCGCAGTGACCCAGTCACCTTGAATGTCACCTATGGC 
IleGlnAsnProValSe rAlaAsnArgScrAspProValThrLeuAsnValThrTyrGly 



790 810 830 

CCGGACACCCCCACCATTTCCCCTTCAGACACCTATTACCGTCCAGGGGCAAACCTCAGC 
ProAspThr ProThr I leSe rProSe rAspThrTy cTy cArgProGlyAlaAsnLeuSe r 

40 
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850 670 690 

CTCTCCTGCTATGCAGCCTCTAACCCACCTGCACAGTACTCCTGGCTTATCAATGGAACA 
LeuT^ rXryrry rAl ffAl ff SerATTTF-riy^t^Al aOl nTy r^^^ 



910 930 950 

• ••••• 

TTCCAGCAAAGCACACAAGAGCTCTTTATCCCTAACATCACTGTGAATAATAGTGGATCC 
PheGlnGlnSccThrGlnGluLeuPhcIleProAsnlleThrValAsnAsnSerGlySer 



970 990 1010 

TATACCTGCCACGCCAATAACTCAGTCACTGGCTGCAACAGGACCACAGTCAAGACGATC 
TyrThrCysHisAlaAsnAsnSe rValThrGlyCysAsnArgThrThrValLysThr lie 



1030 1050 1070 

• ••>•• 
ATAGTCACTGAGCTAAGTCCAGTAGTAGCAAAGCCCCAAATCAAAGCCAGCAAGACCACA 
IleValThrGluLeuSerProValValAlaLysProGlnlleLysAlaSerLysThrThr 



1090 1110 1130 

GTCACAGGAGATAAGGACTCTGTGAACCTGACCTGCTCCACAAATGACACTGGAATCTCC 
ValThrGlyAspLysAspSe rValAsnLcuThrCysSe rThrAsnAspThrGlylleSe r 



1150 1170 1190 

ATCCGTTGGTTCTTCAAAAACCAGAGTCTCCCGTCCTCGGAGAGGATGAAGCTGTCCCAG 
IleArgTrpPhePheLysAsnGlnSerLeuProSerSerGluArgMetLysLeuSerGln 



1210 1230 1250 

GGCAACACCACCCTCAGCATAAACCCTGTCAAGAGGGAGGATGCTGGGACGTATTGGTGT 
GlyAsnThrThrLeuSe r IleAsnProValLysArgGluAspAlaGlyThrTyrTrpCys 
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1270 1290 1310 

• « • • • • 

GAGGTCTTCAACCCAATCAGTAApMCCAAAGCGACCCCATCATGCTGAACGTAAACTAT 
^ GluValPhcAsnProl leSe rLysAsnGlnSe rAspProI leMe tLeuAsnvalAsnTyr 

1330 1350 1370 

AATGCTCTACCACAAGAAAATGGCCTCTCACCTGGGGCCATTGCTGGCATTGTGATTGGA 
AsnAlaLeuProGlnGlUAsnGlyLeuSerProGlyAlalleAlaGlylleValllcGly 

1390 1410 1430 

^5 GTAGTGGCCCTGGTTGCTCTGATAGCAGTAGCCCTGGCATGTTTTCTGCATTTCGGGAAG 
ValvalAlaLeuValAlaLeuIleAlaValAlaLeuAlaCysPheLeuHisPheGlyLys 

1450 1470 1490 

20 

ACCGGCAGCTCAGGACCACTCCAATGACCCACCTAACAAGATGAATGAAGTTACTTATTC 
ThrGlySc rSe rGlyProLeuGln 

25 1510 1530 1550 

taccctgaactttg;^j»^gcccagcaacccacacaaccaacttcagcctccccatccctaac 

30 1570 1590 1610 

agccacagaaataatttattcagaagtaaaaaagcagtaatgaaacctgaaaaaaaaaaa 

35 1630 

aaaaaaaaaa 
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(3) 



10 30 50 

• • • • 

CACCCCTGCTCGAAGCGTTCCTGGAGCCCAAGCTCTCCTCCACAGCTGKACACAGGGCCA 

70 

70 90 110 

GCAGGAGACACCATGGGGCACCTCTCAGCCCCACTTCACAGAGTGCGTGTACCCTGGCAG 



75 



Mc tGlyHi sLeuSe rAldPcoLeuHi sArgValAcgval Pr oTr pGl n 



130 150 170 

GGGCTTCTGCTCACAGCCTCACTTCTAACCTTCTGGAACCCGCCCACCACTGCCCACCTC 
GlyLeuLeuLcuThrAlaSe cLeuLeuThrPheTrpAsnProPcoThcThrAlaGlnLeu 



190 210 230 

ACTACTGAATCCATGCCATTCAATGTTGCAGAGGGGAAGGAGGTTCTTCTCCTTGTCCAC 
ThrThrGluSe rMe tProPhcAsnValAlaGluGlyLysGluValLeuLcuLeuValHi s 



250 270 290 

30 ...... 

AATCTGCCCCAGCAACTTTTTGGCTACAGCTGGTACAAAGGGGAAAGAGTGGATGGCAAC 
AsnLeuProGlnGlnLeuPheGlyTy cSerTrpTycLysGlyGluArgValAspGlyAsn 



310 330 350 

CGTC;^JLATTGTAGGATATGCAATAGGAACTCAACAAGCTACCCCAGGGCCCGC;•-.^-^^CAGC 
ArgGlnlleValGlyTy rAlalleGlyThcGlnGlnAlaThrProGlyProAlaAsnSe r 



370 390 410 

GGTCGAGAGACAATATACCCCAATGCATCCCTGCTGATCCAGAACGTCACCCAG/vA.TGAC 
GlyArgGluThr IleTy rProAsnAlaSe cLeuLeulleGlnAsnValThcGlnAsnAsp 

45 

430 450 470 

• • • • • 

ACAGGATTCTACACCCTACAA.GTCATAAAGTCAGATCTTGTGAATGAAGAAGCAACTGGA 

ThrGlyPheTy rThr teuGlnVdl I leLysSerAspLeuValAsnGluGluAlaThrGly 
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490 510 530 

.C A.GTTC CAIC JATAC AQ CTGC CCAAGp CCTCCATC TC C AG C AAC AAC TC C AAC C C T 
GlnPheHisValTyrPcoCluLeuPro Ly"s"P r oiSerlleSe'rS e"r A ink's nS~e r A's nTHT d 



550 570 590 

• , . • • • 

GTGGAGGACAAGGATGCTGTGGCCTTCACCTGTCAACCTGAGACTCAGGACACAACCTAC 
ValGluAspLysAspAlaValAlaPhcThrCysGluProGluThrGlnAspThrThrTyr 



610 630 650 

CTGTGGTCGATAAACAATCAGAGCCTCCCCGTCAGTCCCAGGCTGCAGCTGTCCAATGGC 
LeuTrpTrpi leAs HAS nGlnSe r Leu Pr oval Sc rProAr gLeuGln Leu Se cAsnGly 



670 690 710 

AACAGGACCCTCACTCTACTCACTGTCACAAGCAATGACACAGGACCCTATGAGTGTGAJ^ 

AsnArgThrLeuThrLeuLeuSerValThrArgAsnAspThrGlyProTyrGluCysGlu 



730 750 770 

ATACAGAACCCAGTGAGTGCGAACCGCAGTGACCCAGTCACCTTGAATGTCACCTATGGC 
IlcGlnAsnProValSe cAlaAsnArgScrAspProValThrLcuAsnValThrTy rGly 



790 BIO 830 

CCGGACACCCCCACCATTTCCeCTTCAGACACCTATTACCGTCCAGGGGCAAACCTCAGC 
? r 0A-5pTh r PTOThtl l-r5ier-P^'o5«-A'9i>T4vc-Tyr-Ty-cAc^^-coCJ yAl a As nLe xxS£^ 



850 870 890 

CTCTCCTGCTATGCAGCCTCTAACCCACCTGCACAGTACTCCTGGCTTATC^ATGGAACA 
LeuSe rCysTy rAlaAlaSe rAsnProProAlaGlnTy rSe rTrpLeuI leAsnGlyThr 



910 930 950 

• • • • . ■ • • 

TTCCAGCAAAGCACACAAGAGCTCTTTATCCCTAACATCACTGTGAATAATAGTGGATCC 
PhcClnGlnSe cThrGlnGluLeuPhcIlcProAsnX IcThrValAsnAsnSe rGlySc c 



970 990 1010 



TATACCTCCCACCCCAATAACTCAGTCACTCGCTGCAACACGACCACAGTCAAGACGATC 
TycThrCysHlsAlaAsnAsnSc rValThcGlyCy sAsnArgTh cThrValLy sThr lie 
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1030 1050 lOTO 

ATAGTCACTGATAATGCTCTACCACAAGAAAATGGCCTCTCACCTGGGGCCATTGCTGGC 
I leValThrAspAsnAlaLeuProGlnGluAsnGlyLcuSe rProGlyAlal leAlaGly 

1090 1110 1130 

ATTGTGATTGGAGTAGTGGCCCTGGTTGCTCTGATAGCAGTAGCCCTGGCATGTTTTCTG 
I leVall leGlyValValAlaLcuValAldLeuIleAldValAlaLeuAlaCysPheLeu 

1150 1170 1190 

... ... 

75 CATTTCGGCAAGACCGGCAGCTCAGGACCACTCCAATGACCCACCTAACAAGATGAATGA 
Hi sPheGlyLysTbcGlySerSerGlyProLeuGln 

1210 1230 1250 

20 ^ ^ . . 

AGTTACTTATTCTACCCTGAACTTTGAAGCCCAGCAACCCACACAACCAACTTCAGCCTC 

1270 1290 1310 

CCCATCCCTAACAGCCACAGAAATAATTTATTCAGAAGTAAAAAAGCAGTAATGAAACCT 



1330 

30 

GAAAAAAAAAAAAAAAAAA 
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(4) 



10 



15 



20 



1 acagcacagctgacagccgtactcaggaagcttctggatcctaggcttatctccacagag 60 

61 gagaacacacaagcagcagagaccatggggcccctctcagcccctccctgcacacacctc 120 

MctGlyProLeuSerAlaProProCysThcHisLeu 

1-21 atca-cttg9a-a.9^gggtc<:-trg<c-tc«^cAtcacttttaa-ac.ttctgg^a.tcc9cccaca... 1.80 - 
I leThrTrpLysGlyValLeuLeuThrAlaSerLeuLeuAsnPheTrpAsnProProThr 

181 actgcccaagtcacgattgaagcccagccacccaaagtttctgaggggaagga tgttctt 240 
ThrAlaGlnValThrlleGluAlaGlnProProLysValSerGluGlyLysAspValLeu 

241 ctacttgtccacaat.ttgccccagaatcttgctggctacatttggtacaaagggcaaatg 300 
LeuLeuValHisAsnLeuProGlnAsnLeuAlaGlyTyrlleTrpTyrLysGlyGlnMet 

301 acatacgtctaccattacattacatcatatgtagtagacgg.tcaaagaattatatatggg 360 
ThrTyrValTyrHisTyrlleThrSerTyrValValAspGlyGlnArgllelletyrGly 

361 cctgcatacagtggeagagaaagagtatattccaatgcatccctgctgatccagaatgtc 4 20 
ProAlaTyrSerGlyArgGluArgValTyrScrAsnAlaSerLcuLeuIleGlnAsriVal 

.4 21. acgcaggaggatgcaggatcc tacacc ttacacatca taaagcgacgcga tgggac tgga 480 
ThrGlnGluAspAlaGlySerTyrThrLeuHisIlelleLysArgArgAspGlyThrGry 

4 81 ggagtaactggacatttcaccttcacct tacacc tggagactcccaagccctccatc tec 54 0 
GlyValThrGlyKisPheThrPheThrLeuHisLeuGluThrProLysProSerlleSer 

541 agcagcaacttaaatcccagggaggccatggaggctgtgatcttaacctgtga tcctqcg 600 
SerSerAsnLeuAsnProArgGluAlaWetGluAlaVallleLeuThrCysAspProAla 

601 actccagccgcaagctaccagtgg tgga tgaatggtcagagcctccctatgac tea cagg 660 
ThrProAlaAlaSerTyrGlnTrpTrpMetAsnGlyGlnSerLeuProMetThrHisArg 

661 ttgeagetgtecaaaaceaaeaggaccctetttatatttggtgtcacaaagtata ttcca 720 
LeuGlnLeuSerLysThrAsnArgThrLeuPhellcPheGlyValThrLysTyrlleAla 

"7'll ggaccctatgaatgtgaeatacggaacccagtgagtgccagccgcagtgacccagtcacc 780 
GlyProTyrGluCysGluIleArgAsnProValSerAlaSerArgSerAspProValThr 

781 etgaatctcctcceaaagctgtceaagccetaeatcacaatcaacaacttaaaccccaga 84 0 
LeuAsnLeuLeuProLysLeuSerLysProTyrlleThrlleAsnAsnLeuAsnProArg 

841 gagaataaggatgtettaacctteaectgtgaacctaagagt gaga a c tacacc tacatt 900 
GluAsnLysAspValLeuThcPheThcCysGluProLysSerGluAsnTyrThrTyrlle 

901 tggtggctaaatggtcagagectccctgtcagtcccagggtaaagcgaccca ttgaaaac 960 
TrpTrpLeuAsnGlyGlnSerLeuProValSe rProArgValLysArgProI leGluAsn 

961 aggatcctcattctacccaatgtcacgagaaatgaaacaggaccttatcaatgtgaaata 1020 
Argi leLeulleLeuProAsnValThrArgAsnGluThrGlyProTyrGlnCysGluI le 

1021 cgggaccgata tggtggcatccgcagtgacccagtcaccctgaatgtcctctatggtcca 1080 
ArgAspArgTyrGlyGlylleArgSerAspProValThrLeuAsnValLeuTyrGlyPro 
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1081 9^cctccccagcatttacccttcattcacctattaccgttca99d9d^ddcctctacttt 1X40 
AspLeuProSer IleTyrProSerPheThrTyrTyrArgSerGlyGluAsnLeuTyrPhe 

1141 tcctgcttcggtgagtctaacccacgggcacadtattcttggacaattddtgggaagttt 1200 

5 SerCysPheGlyGluSerAsnProArgAlaGlnTyrSerTrpThrlleAsnGlyLysPhe 

- 1201 -cagctatcaggacaaaagctctctdtcccccaddtdactacaaagcatdgtgggctctat 1260 
GlnLeuSecGlyGlnLysLeuSerlleProGlnlleThcThrLysHisSerGlyLeuTyr 

1261 gcttgctctgttcgtaactcagccactggcaaggaaagctccaaatccatcacagtcaaa 1320 

70 AlaCysSerValArgAsriSerAlaThrGlyLysGluSerSerLysSerlleThrValLys 

1321 gtctctgactggatattaccctgaattctactagttcctccaattccattttctcccatg 1360 
ValSerAspTrpIleLeuProEnd 

75 ,Jp81 gaatcacgaagagcaagacccactctgttccagaagccctataatctggagg tggacaac 1*4 4 0 

1^41 tcgatgtaaatttcatgggaaaacccttgtacctgacatgtgagccactcagaactcacc 1500 

1501 aaaatgttcgacaccataacaacagctactcaaactgtaaaccaggataagaagttgatg 1560 

1561 acttcacactgtggacagtttttcaaagatgtcataacaagactccccatcatgacaagg 1620 

1621 ctccaccctctactgtctgctcatgcctgcctctttcacttggcaggataatgcagtcat 1680 

1681 tagaatttcacatgtagtagcttctgagggtaacaacagagtgtcagatatgtcatctca 1740 

20 1741 acctcaaacttttacg taacatctcagggaaatgtggctctctccatcttgcatacaggg 1800 

1801 ctcccaatagaaatgaacacagagatattgcctgtgtgtttgcagagaagatggtttcta 18 60 

1861 taaagagtaggaaagctgaaattatagtagagtctcctttaaatgcacattgtgtggatg 192 0 

1921 gctctcaccatttcctaagagatacag tgtaaaeiacgtgacagtaatactgattctagca 1980 

1981 gaataaacatgtaccacatttgcaaaaaa 2010 
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(5) 



1 g99tggatcctaggctcatctccatag9g9agaacacacatacagcaga9accat9gga 59 
5 MetGly 

60 cccctctcagcccctccctgcactcagcacatcacctggaaggggctcctgctcacagca 119 
ProLeuSerAlaProProCysThrGlnHisIleThrTrpLysGlyLeuLeuLeuThrAla 

120 tcacttttaaacttctggaacctgcccaccactgcccaagtaataattgaagcccagcca 179 
70 SerLcuLeuAsnPheTrpAsnLeuProThrThrAlaGlnValllelleGluAlaGlnPro 

180 cccaaagtttctgaggggaaggatgttcttctacttgtccacaatttgccccagaatctt 239 
ProLysValSerGluGlyLysAspValLeuLeuLeuValHisAsnLeuProGlnAsnLeu 

240 actggctacatctggtacaaagggcaaatgacggacctctaccattacattacatcatat 299 
75 ThrGlyTyrlleTrpTyrLysGlyGlnMetThrAspLeuTy.rHisTyrlleThrSerTyr 

0:300 gtagtagacggtcaaattatatatgggcctgcctacagtggacgagaaacagtatattcc 359 
ValValAspGlyGlnllelleTyrGlyProAlaTyrSerGlyArgGluThrValTyrSer 

360 aatgcatccctgctgatccagaatgtcacacaggaggatgcaggatcctacaccttacac 419 
20 AsnAlaSerLeuLeulleGlnAsnValThrGlnGluAspAlaGlySerTyrThrLeuHis 

420 atcataaagcgaggcgatgggactggaggagtaactggatatttcactgtcaccttatac 479 
IlelleLysArgGlyAspGlyThrGlyGlyValThrGlyTyrPheThrValThrLeuTyr 

480 tcggagactcccaagcgctccatctccagcagcaacttaaaccccagggaggtcatggag 539 
25 SerGluThrProLysArgSerlleSerSerSerAsnLeuAsnProArgGluValMetGlu 

540 gctgtgcgcttaatctgtgatcctgagactccggatgcaagctacctgtggttgctgaat 599 
AlaValArgLeuIleCysAspProGluThrProAspAlaSerTyrLeuTrpLeuLeuAsn 

600 ggtcagaacctccctatgactcacaggttgcagctgtccaaaaccaacaggaccctctat 659 
30 GlyGlnAsnLeuProMetThrHisArgLeuGlnLeuSerLysThrAsnArgThrLeuTyr 

660 ctatttggtgtcacaaagtatattgcagggccctatgaatgtgaaatacggaggggagtg 719 
S LeuPheGlyValThrLysTyrlleAlaGlyProTyrGluCysGluIleArgArgGlyVal 

720 agtgccagccgcagtgacccagtcaccctgaatctcctcccgaagctgcccatgccttac 779 
35 SerAlaSerArgSerAspProValThrLeuAsnLeuLeuProLysLeuProMetProTyr 

780 atcaccatcaacaacttaaaccccagggagaagaaggatgtgttagccttcacctgtgaa 839 
IleThrlleAsnAsnLeuAsnProArgGluLysLysAspValLeuAlaPheThrCysGlu 

84 0 cctaagagtcggaactacacctacatttggtggctaaatggtcagagcctcccggtcagt 899 
40 ProI-ysSerArgAsnTyrThrTyrlleTrpTrpLeuAsnGlyGlnSerLeuProValSer 

900 ccgagggtaaagcgacccattgaaaacaggatactcattctacccagtgtcacgagaaat 959 
ProArgValLysArgProlleGluAsnArglleLeuIleLeuProSerValThrArgAsn 

960 gaaacaggaccctatcaatgtgaaatacgggaccgatatggtggcatccgcagtaaccca 1019 
45 GluThrGlyProTyrGlnCysGlulleArgAspArgTyrGlyGlylleArgSerAsnPro 
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1020 gtcaccctgaatgtcctctatggtccagacctccccagaatttacccttacttcacctat 1079 
ValThcLeuAsnValLeuTyrGlyPrbAspLeuProArglleTyrPcoTyrPheThrTyr 

1080 taccgttcaggagaaaacctcgacttgtcctgctttgcggactctaacccaccggcagag 1139 
^ TyrArgSerGlyGluAsnLeuAspLeuSerCysPheAlaAspSerAsnProProAlaGlu 

1140 tatttttggacaattaatgggaagtttcagctatcaggacaaaagctctttatcccccaa 1199 
TyrPheTrpThcIleAsnGlyLysPheGlnLeuSerGlyGlnLysLeuPhelleProGIn 



10 



1200 attactacaaatcatagcgggctctatgcttgctctgttcgtaactcagccactggcaag 1259 
IleThrThrAsnHisSerGlyLeuTyrAlaCysSerValArgAsnSerAlaThrGlyLys 

1260 gaaatctccaaatccatgatagtcaaagtctctggtccctgccatggaaaccagacagag 1319 
GluIleSerLysSernetlleValLysValSerGlyProCysHisGlyAsnGlnThrGlu 

1320 tctcattaatggctgccacaatagagacactgagaaaaagaacaggttgataccttcatg 1379 

75 SerHisEnd 

1380 aaattcaagacaaagaagaaaaaggctcaatgttattggactaaataetcaaaaggataa 14 39 

1440 tgttttcataatttttattggaaaatgtgctgattcttggaatgtttt'attctccagatt 1499 

1500 tatgaactttttttcttcagcaattggtaaagtatacttttgtaaacaaaaattgaaaca 1559 

1560 tttgcttttgctctctatctgagtgccccccc 1591 

20 



2. A replicable recombinant cloning vehicle having an insert comprising a nucleic acid of claim 1 . 
25 3. A cell that is transfected, infected or injected with a recombinant cloning vehicle of claim 2. 

4. A method for preparing a polypeptide, said method comprising the steps of 

(a) culturing the cell of claim 3 

(b) recovering the polypeptide expressed by said cell. 



30 



35 



5. A method for preparing an antibody directed against a polypeptide said method comprising the steps of 

(a) preparing said polypeptide by the method of claim 4 

(b) injecting said polypeptide into a host capable of producing antibodies and 

(c) recovering said antibodies. 

Patentanspriiche 



1. Nucleinsaure. umfassend eine Basen-Sequenz, die fur eine Peptid-Sequenz codiert, 

dadurch gekennzeichnet. daB die Gruppen-Nucleinsaure eine DNA ist, die aus der folgenden Gruppe 
40 von funf Sequenzen ausgewahit ist: 
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10 30 50 

CAGCCGTCCTCGAACCCTTCCTGGACCCCAAGCTCTCCTCCACAGGTGPJvGACACCCCC.n 

5 

70 50 no 

GCAGCAGAC.-.CCATGGGGCACCTC7CAGCCCCAC7TCACAGAGTGCCTGTACCCTGGCAG 
r;e iGlyHi sLeuSe rAldPr oLeuHi sAr gVelAryvalProTrpGln 

70 

1-0 150 170 

GGGCT7CTGCTCACAGCCTCACTrCTA.\CCTVCTGGAACCCGCCCACCACTGCC:CAGC-I C 
75 Glyle v"-e v-e vThrAl tSe rLe uLeuThr PheTr pAsn? r c? rcThrThr Al jiGl i-.L2v 

HO 210 130 

AC7AC70>J^7CCATGCCAT7C.-.^TGTTGCAvAGC•GG^^-X•GAGGT7C77CTCC*rrC•:CC 
7h:7h:GluSe rriS t?ro?heAsr/%;el AlcGluGlyLysGluValLcuLcvLeuVElH 

2rO 270 290 

A.-,7C7GCCCCAGCAAC777T7GGCTACAGC7GG7AC^«'.i.GGGGA.-.AG;X-7GGA7GGCAAC 
AsnL?u?:cGlriGlnLei3r^eGlyTy r£.€r7r c7yr LysGlyG • uAr 9Vr.l AipGl^y 



350 350 



CG7CA.iv^77G7AGGA7A7GCA.-,TAGGA.^CTC;^^-X.i^--XC7ACCCCAGGGCCCGC;>.-^^^^ 
ArcGinlleValGlyTyr Al ft 2 i cGl vTh: Gl nGlnAl aThr?:cCly?roAl aAsnSe r 

370 350 CIO 

35 GG7CGACACAC>J^TATACCCCAATGCATCCCTCCTCATCCAGAACCTCACCCACA.ATGAC 
CI yAf gCluThr 1 leTyrProAsnAleSerLeuLtull cClnAsnVftlThrClnAtnAsp 

<i30 «S0 <10 

40 ACAGGATTCTACACCCTACAAGTCA7/*LAACTCAGATCTTGTG;^7GAjkGAJVGCAACTGGA 

ThrClyPheTy cThrLeuClnVftl 1 leUysSe r AspLeuva 1 AsnCl uCl uAl dTh rCi y 
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ccc SIO S30 

CAGT7CCArGr.=vTACCCGCAGCTCCCCK/\GCCCTCCATCTCCAGC/V\C;^ACTCCA.J^CCCT 
CI nPheHi sv* iTy : ? r oGluLeu? roLy s ? roSe r 1 1 c Sc r 5c r AsnAsnSc rAsnPro 



SS: S70 ^90 

GTGGAGGAC.-.-.GCA7GCTGTGGCCTTCACCTGTGAACCTC^.GACTCAGGACAC;i^^CCTAC 
10 valGluASy-ysAspAlevalAlaPheThrCysGluProGluThrGlnAspThrThrTyr 



6iC 630 650 

75 CTGTGGTGGATA..V^CA.s,TCACAGCCTCCCGCTCAGTCCCAGGCTGCAGCTGTCCAATGGC 
LeuicpTrpI l-As.-AsnGlnScrLeuProVdlScrProArgLeuClnLcuScrAsnCly 



s:: 690 *710 

AACAGGACCCTCACTCTACTCAGTCTCACAAGGAATGACACAGCACCCTATGAGTGTGAA 
AsnArcTh r •;.ecThr LeuLcuSc r vp. iTh r Ac gAtnAs pTh rCl y ? r oTy rGluCy&Glu 



li: 750 770 

ATACAG.-JkCCCAGTGAGTGCGAACCGCAGTGACCCAGTCACCTTCXMXTCACCTATCGC 
! 1 eGlr.Asr.PrcVilScrAlaAsnAccSecAspProVaiThr LsuAsnVdlThrTyrCly 

- c ■ 810 B 3 0 

30 " 

CCGGACACC::CACCATT7CCCCT7CAGACACCTATTACCGTCCAGGGGCAAACCTCAGC 
?rcAsoTh:?roThr IleSerProScrAspThrTyrTyrArgProGlyAldAsnLcuSer 



35 Qjr. 870 SSO 

• • • • * 
CTCTCCTGCTATGCAGCCTCT>-^CCCACCTGCACAGTACTCCTGGC7TATC.A.:vrGG.^^^CA 

LeuSc rCysTyrAiaAldSc rAsn?ro?roAleGln7yrSerTi pLcuI leAsnGlyThr 

40 

91:. 930 550 

• • ■ . * 

TTCCACC>----.GCACAC.V^GACC7C7TTATCCCTA.^CATCAC7GTGA.^TP^7AGTGGATCC 

PheGlnGlnSe r7hrGlnGluLcu?hcllcProAsnl ieThrVdlAsr.AsnSe rClySe r 

45 

970 990 1010 

TATACCTCCCACCCCAATi^CTCACTCACT'JGCTGCAACACCACCACACTCAACACGATC 
TytThrCysHitAlftAsnAsnSecValThrCl yCy sAsnAr 9TU c 7hr Vd 1 Ly sTh r I 1 e 
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1030 lOSO 1070 

• . • . , 

ATACTCACTGATAATCCTCTACCACAACXAAA7CCCCTCTCACCTCCGCCCATTCCTCCC 
I i eValThrAspAsnAldLeu?roClnGluAsnCl yLeuSer?roClyAl£!leAlaGly 

5 

IOSj 1110 1130 



70 



ATTCTGA7TCCACTAGTGGCCCTGGT7GCTC7CATACCAGTAGCCC7GGCATGTTTTCTG 

1 level 1 1 €GlyVelVdlAleLeuValAldLeuncAlc;VfclAlftLeuAlsCvs?heLeu 



115: ll'^O 1190 

CA777CGGG.-.^GACCGGCAGGCCA.\GCGACCAGCG7GArC7CACACAGCACAAACCC7CA 
15 Hi sPheGlyLysThrGlyArgAlaSecAspGlnArgAspLeuThrCluHi sLysProSer 

1210 1230 1250 

20 G7C7CCAAeCACACTCACCACCAC7CCAATCACCCACCTAACAAGA7CAA7GAAGT7AC7 
val Se rAsnHi s7hcGlnAspllisSei:AsnAsp?roProA$nLysns tAsnGluValThr 



127 0 1290 1310 

2^ rA77C rACCC7CAAC777GAAGCCCACCAACCCACACAACCAACT7CACCC7CCCCATCC 
7y rSe r7*n : LeuAsnPheGl oAi eGlnCl n? r oThrCl n? ro7h c Se rAleSerProSer 

133D 13S0 13-0 

30 ...... 

C7AACAGCCACACAAA7AA777A77CAG>J^G7f>AAJU^GCAGTAA7GAA.SiCCrG7CCTGC 
Leu7hrAleThrGluI lclie7yrSerGii:VelLysLysGl-n 

35 i5SC KIO K30 

• • • • . , 

7CAC7GCAG7GC7GATCTAT7TCA/\GTC7C7CACCC7CATCACTAGGACA77CC777CCC 

^ 1450 1470 1490 

C7G7AGGG7AGAGGGG7GGCCACACAAACAAC777C7CC7AC7C77CC77CC7AA7ACGC 

1510 1530 15S0 

45 

ATC7CCAGGCTGCC7GGTCACTGCCCCTCTC7CAGTG7Ci>J^ rAGA7GAAAGTACATTGGG 
I 

I5">b 1590 1610 

50 ' ' * " " ' " 

AG7CTC7AGGAAACCCAACCT7C77G7CA7TCAuAA7TTCGCAAAGC rGAC77rGGGAAAG 
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1S3C 16 50 16 «0 

ACGGACCACA.-.C77CCCC7CCCTTCCCCTTTTCCCAACCTCCAC7rCTTTTA.'v.-.CTTGCC 

S 

16S0 . 1710 1730 

7C7TCAGACCA.C7CA77CC7TCCCACCCCCACTCCTCTCCTATCAC7Cr.iJ^TTCGCAT77 

w 

nSO 1770 1790 

GCCATAGCC?TGACGTTA7CTCCTTTTCCATTAJ^CTACA7CTCCCAGG/%AACAGCC;^.CAC 

75 

1810 1830 1850 

* * • * • . 
AGAGAAAGTA.f-2.CGGCAGT;^JVTGCTTCTCCTATTTCTCC;kJVAGCCTTCTGTGAACTAGCA 

20 

1 6"/0 1890 - 19ia 

• • ■ • . , 
AACAGAACAAAA7C.*^V^TATA7AACCAATACTCAAA7GCCACAGC7T7GTCCACTG7CAG 

1930 1950 1970 

GG77G7C7ACC7GTAGGA7CAGGG7C7A-\GCACC77GC7GC77AGC7AGAA7ACCACC7A 

^° 19S0 2010 2030 

A7CC77C7GGC.--i.GCC7G7C77CAGAGAACCCAC7AG/vAGC;iJ^C7AGGAAAA-a,7CAC77G 

35 2050 2070 2090 

CCAAAA7CC^-a.GGC;'Ji.7TCC7GA7GG/v>AATGC/xAAACCACA7A7A7G77-rTAA7A7CT7 

40 2110 2130 2150 

7A7GGGC7C7G77CAAGGCACTGCTGAGAGGGAGGGGT7ATAGCT7CAGGAGGG/ui.CCAG 

45 2 1 70 2190 221 0 

C77C7GA7AJ^qAC^-k7CTGCTAGGAACTTGCGA*AAGGAATCAGAGACC7GCCCTTCAGC 



50 
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2230 2230 2270 

CAT7ATTTA.VAT7GTT.\>J<GAwATACACKATTTGGCGTATTGCGAT7TTTCTCCT77TCTC 

5 

2 290 2 310 2 3 30 

TGACACA7TCCACCA7T7T.--:iTTTTTC7A-^XTGCTTATTTATCTCA.^^J5.GGG77AT7TTT 

10 

235: 2370 2390 

AC77AGC7TACC7A7G7CAGCC^^7CCCATTCCC7TAGGTCAAA»GAAACCACCGAAA7CC 

75 

2<10 2430 * 2450 

CTCAGGTCCCT7CGTCAGGAGCCTCTCAAGATTT7TTTTGTCAGAGCCTCCAAATAGAAA 

20 

2O0 2490 2510 

A7AAGA.AAACCT7T7C7TCAT7CATCCC7AGAGCTACATT7AJVCTCACTT7CTAGGCACC 

2550 2550 2570 

7CAGACCAA7CA7CAAC7ACCATTCTATTCCATGTT7GCACCTGTGCATT7TCTGTTTGC 

30 2590 2610 2630 

CCCCA7TCAC777G7CAGGA^CCTTGGCCTCTGCTAAGGTGTATTTGG7CCTTGAGAAG 



35 



40 



45 



50 



2650 2670 259C 

7GGGAGCACCC7ACAGCGACACTATCACTCATGCTGGTGGCATTGT77ACAGC7AG;iJ^.^.G 

2710 2730 2750 

C7GCACTGG7GC7A-a,7GCCCCTTGGGA;vA7GGGGCTGTGAGGAGGAGGA7TA7/..kCTTAG 

2770 2790 2010 

CCC7AGCC7CTT77A--.CAGCC7CTGAAAT77ATC77TTCTTCTATGCGGTC7ATAAATGT 

2830 2D50 2870 

' • • • ♦ 

ATCTTATAATA/>>vAAGGAAGCACAGCAGGAAGACAGGCAAA7GTACTTC7CACCCAG*rCT 
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2590 2S10 2930 

7CTACACACATGCAATCTCTTTCCGCCTAAGAC;j^GG7TTTATTCTATATTGC7TACCT 

5 

2S50 2970 2990 

CATCTCATCTTAGGCCT^J\GACGCTTTCTCCAGGAGGA7TAGCTrGGAGTTC7CTATACT 

70 

3010 3030 3050 

CAGG7ACC7C777CAGGG7777C7AACCC7GACACGGAC7G7GCA7AC7r7CCC7CA7CC 

75 

30*0 3090 3110 

A7GC7G7GC7G7G77AT77AA77777CC7GGCTAAGATCATGTCTGA7%T7A7G7A7GAyAA 

20 

3130 3150 3170 

A7TA77C7A7G77777ATAA7AJLAAA7AA7A7ATCAGACATCGAAA;vAjy\ykAA , 

25 
30 
35 
40 
45 
50 
55 
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(2) 

10 30 50 

CACCCG7GCTCGAAGCG77CCTGGAGCCCAAGCTCTCCTCCACAGGTGAAGACAGGGCCA 

70 SO 110 

GCAGGAGACACCATGGGGCACCTCTCAGCCCCACTTCACAGAGTGCGTGTACCCTGGCAG 
rte tGl vHi sLeuSc rAlaProLeuHi sArgVelArgValProTrpGln 



130 150 no 

GGGCTTCTGCTCACAGCCTCACTTCTAACCTTCTGGAACCCGCCCACCACTGCCCAGCTC 
GlyLeuLeuLeuTh rAl eSe r LeuLeuThcPheTrpAsnProProTh rThr Al aGlnLeu 



190 210 230 

ACTACTGAATCCATGCCATTC.^-ATGTTGCAGAGGGGAAGGAGGTTCTTCTfCTTGTCCAC 
ThrThrGlu5e rne tProPheAsnValAlaGruGlyLysGluValLeuLeuLeuVelHi s 



250 270 290 

- 

A^7CTGCCCCAGCAACTTTTTGGCTACAGCTGGTACA-*L^GGGGAA,AGAG7GGATGGCXAC 
30 AsnLeu?r:oGlnGlnLeu?heGly7y r Sc r7rp7y r LysGlyGluAr gVaiAspGi vAsn 



310 330 350 

35 CGTCA^.-.7TG7AGGATATGCAA7AGGA^AC7CAJi.CA-AGC7ACCCCAGGGCCCGCAJiJ^CAGC 
ArcGlnl leva lGly7yrAl all eGly7hrGlnGlnAla7h rProGlyPrcAlaAsr.se r 



370 390 410 

40 ' 

GG7CGAGAGACAA7A7ACCCCAA7GCA7CCC7GC7GA7CCAGAACG7CACCCAG^J^.7GAC 
GlyArgGlu7hr I le7yrProAsnAlaSerLeuLeuIlcGlnAsnVal7hrGlnAsrAsp 
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43D 470 

, - " " 

ACAGCATTCTACACCCTAC?iAGTCATAAACTCACATCTTCTGAATGAACAAGCAAC7GCA 
ThrClyPheTyiThrLe-jGlnval lleLysSerAspLeuVelAsnGluGluAle-hrCly 

<cc 510 530 

CAGTTCCA-GTATACCCGGACCTGCCCAAGCCCTCCATCTCCAGCAACAACTCCAACCCT 
GlnPheHi sValTyrProGluLeuProLysPtoSerllcSerSe rAsnAsnSe rAsnP r o 



55C 570 590 

GTGGAGCACX^GGA-GCTGTGGCCTTCACCTGTGAACCTGACACTCACGACACAACCTAC 
valGluAsp'-ysAspAlavalAlaPheThrCysCluFroCluThrGlnAspThrThrTyr 



610 630 650 

CTGTGGTGGATAAACAATCACAGCCTCCCGGTCAGTCCCAGGCTGCAGCTGTCCA.\TGGC 
LeuTrpTrp-ieAsnAsnGlnSerLeuPtoVelScrProArgLeuGlnLeuSerAsnGly 



670 6S0 710 

, , • • • ■ 

AJ\CAGGACCCTCAC7C7ACTCAGTGTCACAAGGAATGACACAGGACCCTA7GAGTGTG.SwA 
AsnAr57hrL£j7hrLeuLeuSetVal7hrAr9AsnAspThrGly?roTyrGloCysGlu 



730 "50 no 

A7ACAGAACCCAGTGACTGCCAACCCCAG7GACCCAGTCACCTTGAA7GTCACC7A7GGC 
IleClnAsnProValSerAlaAsnArgSerAspProValThrLeuAsnvaiThrTyrGly 



7S0 810 B30 

CCCCACACCCCCACCATTTCCCCTTCAGACACCTATTACCGTCCAGGGGCAAACC7CACC 
ProAspThrProThrlleSerProSerAspThrTyrTyrArgProGlyAldAsnLcuSe r 
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550 B70 890 

CTCTCC7GC?A?CCACCCTCTAACCCACCTCCACAC7ACTCC7CGC7TATCA^7GC-.-,s>CA 
LeuSe'rCy sTy rAleAlftSe rArnfro^roAleClnTyrSer-TrpLeul le.*r&nG-y7hr 



S:D 930 9S0 

T7CCAGCA.i.^GCACACrJ^GAGC7C7T7ATCCCT;'-S<CA7CAC7G7GAA7A^.7AGTGGA7CC 
PheGlnGlnSe r7hrGlnGluLeuPhellePtoAsnl leThcvalAsnAsnSe rGlySe r 



970 990 1010 

. • • • • • 

TATACCTGCCACGCCPcATAACTCAGTCACTGGCTGCAACAGGACCACAGTCAAGACGATC 
Ty rThrCysHi sAi aAsnAsnSe rvalThrGlyCy sAsnArgThr Th r val Ly sTh r 1 1 e 



1030 1050- 1070 

20 ATAGTCACTGAGCTAAGTCCAGTAGTAGCAAAGCCCCAA-ATCAAAGCCAGCAAGACCACA 
IleValThrGiuLeuSerProValValAlaLysProGlnlleLysAlaSerLysThrThr 



ICSC 1110 1130 

GTCACAGGAGATAAGGACTCTGTGAA.CCTGACC7GCTCCACAAATGACACTGGAA7CTCC 
ValThrGiyAspLysAspSc rValAsnLeuThrCysSerThrAsnAspThrGlylleSec 

30 liSO 1170 1190 

• • • ■ • • • 

ATCCGTTGGTTCTTCAAAAACCAGAGTCTCCCGTCCTCGGAGAGGATGAAGCTGTCCCAG 
I IcArgTrpPhcPhcLysAsnGlnSe rLeuProSe rSe rCluArgKe tLysLeuSe rGln 



1210 1230 1250 

GGCAACACCACCCTCAGCATAAACCCTGTCAAGAGGGAGGATGCTGGGACGTATTGGTGT 
GlyAsnThrThrLeuSe r 1 1 eAsnProVa 1 LysArgGluAspAlaGlyThrTy rTr pCy s 
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;21C- 1290 1310 

CACG7CrTC.^.-.CCCAATCAGTA^GAACCAAAGCGACCCCATCATGCTG;^^CG7AAACTAT 
GluVftl pheAsnProI 1 e Se r Ly sAsnGl nSe rAso? r ol 1 enc tLcuAsnVelAsnTyr 

5 

123: 1350 1370 

A.nTGCTCTACCAC.^AG.^^^^TGGCCTCTCACCTGGGGCCATTGCTGGCATTGTGATTGGA 
10 AsnAleLeu? :oGlnGluAsnGlyLcuSe rProGlyAlal 1 eAl eGl y 1 1 e Val 1 1 eGly 

1390 1410 1430 

... 

75 G7AGTGCCCCTGG7TGCTCTCATAGCAGTAGCCCTGGCATGTTTTCTGCATT7CGGGAAG 
ValValAleLcuValAleLeuIleAlaValAldLcuAlaCysPheLeuHisPheGlyLys 

1450 1470 1490 

20 ' • 

ACCGGCAGC7CAGGACCACTCCAA7GACCCACC7AACAAGATGAATG;^J^G77AC77AT7C 
7hrGly5e r Se rGlyProLeuGln 

25 1530 1550 

7ACCC7GAAC777GA.nGCCCAGCAACCCACACAACCA-^CT7CAGCCiCCCCA7CCCTAAC 

3^ 1570 1590 1610 

AGCCACAG.2^-JtT.=^^.TT7A7TCAGAAGTAAAAAAGCAG7AATGAJ^JVCCTG^--^^^^ 

1630 
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(3) 



10 30 50 

CACCCC':CCTCG,a.2.GCGTTCCTCGACCCCAACC7CTCCTCCACACGTCAJKCACAGGGCCA 

-0 90 no 

GCAGCACACACCATGGGGCACCTCTCAGCCCCACTTCACAGAGTGCGTGTACCC7GGCAC 
ric tGlyni sLcuScrAldProLcuHisArgValArgval ProTrpG) n 

130 150 170 

GGGC7TCTGCTCACAGCCTCACT7CTAACCTTCTGGAACCCCCCCACCACTGCCCAGCTC 
GlyLeuLeuLcuThrAl aSc rLeuLeuThrPhcTrpA5n?roProThr7hrAlaClnLeu 

20 

190 210 230 



25 



AC7AC7GAATCCA7GCCA7TCXATG7TGCAtJAGGGGAAGGAGG7TC7TCTCC7TGTCCAC 
7hr7h rGluSe r Me t ? r o?hc AsnVa 1 Ai aGl uG I y Ly sGluVa 1 LeuLeuLe uVa i H i s 

250 270 290 

AA7CTGCCCCAGCA.-X777TTGGCTACAGCTGGTACAXAGGGGAAAGAGTGGA7GGC/\.^.C 
30 AsnLeu?rcGlnGlr.L5u?heGly7y rSc r7rpTyrLysClyCluArcValAs?GlyAsr. 

Si:- 330 350 

35 CG7CAjyjk77G7AGGA7A7GCAJ^7AGCAAC7CAACPJiGC7ACCCCAGGGCCCGC/VVi'.CAGC 
Ar^Glnl ieVaiGiy7y rAlai 1 cGly7hrGlnGlnAlaThr?roGly?roAl aAsnSs r 

370 390 <10 

40 

GG7CGAGAGACA^7A7ACCCCAA7GCA7CCC7GC7GA7CCAGAACG7CACCCAG;v-.TGAC 
GlyAr5Clu7hf 2lc7y rProAsnAlaSc rLcuLcuI leClnAsnVal7hcGlnAsr.Asp 



45 



430 450 470 

ACAGGA77CTACACCCTACAA.GTCATAAAG7CACATC rTCTGAA7GAACAAGCAACTGGA 
7hrClyPhc7yrThcLeuGlnValIleLysSerAspLeuValAsnGluGluAla7hrGly 
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<oo SIO 530 

CACTTCCATCTATftCCCCCAGCTCCCCAACCCCTCCATCTCCACCAACAACTCCAACCCr 
ClnPheHi sVilTyrProCluLeuProLysProSe rlleSerSerAsnAsnSe rAsnP'ro' 

SSO STO 590 

GTGGAGCACA.^GGATCCTG7GCCCTTCACCTGTGA.:^CCTGAGAC7CAGGACAC;v^CCTAC 
vaiGluAspLysAsoAlaValAlePheThrCysCluPcoGluThrGlnAspThrThrTy r 

610 630 650 

CTGTGCTCCATXAACAJ^TCAG.^GCCTCCCGGTCAGTCCCACCCTGCACCTGTCCAJVTGGC 
LeuTrpTrpI i eAsnAsnGlnSe r LeuP roVelSc rProAcoLcuGlnLeuSc rAsnGly 

6"70 690 ^ 710 

AACAGGACCCTCACTCTACTCAGTGTCAC/KAGGAATGACACAGGACCCTATCAGTGTGA^i. 

Asr.ArgThrLfiuThrLeuLcuSc rvalThrArgAsnAspThrGlyProTy rGluCysGlu 

130 750 770 

, • • • • * " 

ATACAG.^uACCCAGTGAGTGCGAACCGCAGTGACCCAGTCACCTTGAJ^TGTCACCTATGGC 
1 leGlnAsnProValScrAlaAsnArgSerAspProValThrLeuAsnvalThrTyrCly 



■790 310 530 

CCGGACACCCCCACCATTTCCCCTTCAGACACCTATTACCGTCCAGGGGCAAACCTCAGC 
-rcASDTh:?:oThr-2 lr5« r Pr crSe rAspThcTy rTy rAr gProGiyAl&AsnLeuSiX-, 



£5: 870 E?0 

:7CTCCTGC:ArGCAGCCTCT>-ACCCACCTGCACAGTACTCCTGGCTTATCr^^TGGA«\CA 
.euSe r CvsTvrAlaAldSe r AsnProProAlaGlnTy r Se rTrpLeuI 1 cAsnGlyThr 



910 930 950 

TTCCAGCAA.ivGCACACAA.GACCTCTTTATCCCTAACATCACTGTGAATAATAGrGGATCC 
rheGlnGlnSerThrClnCluLeurhellcProAsnllcThrValAsnAsnScrGlyScr 

970 990 lOlO 

TATACCTGCCACGCCAATAACTCAGTCACTGGCTGCAACAGGACCACAGTCAAGACGATC 
TycThcCysKlsAlaAsnAsnScrValThcGIyCysAsnAcgThcThcValLysThcllc 
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1030 1050 1070 

A7ACTCACTCAT>.-.?GCTCTACCAC;^^G/^JVTGCCCTCTCACCTGCCCCCATTGCT^ 
5 : i e v> 1 7h r.-.s pAsr.Al eLeuP rod nGluAsnGl yLeuSe r ? r oGl yAl a 2 1 e Al eGl y 

1090 1110 1130 

ATTGTGATTCGACTACTGGCCCTGGTTGCTCTGATAGCAGTACCCCTGGCATGTTTTCTG 
'° ilevalil eGlyvalvalAlaLeuValAlaLeuI leAl aval Ala LeuAlaCysPhe Leu 

iiSO ll"?© 1190 

^5 CATTTCGGCAAGACCGCCACCTCACGACCACTCCAATGACCCACCTAACAAGATGAATCA 
Hi sPhcGlyLysThrClySe rSe rClyProLcuGln 

1210 1230 1250 

20 ...... 

AGTTACTTATTCTACCCTGAACTTTCAAGCC^AGCAACCCACACAACCAACTTCAGCCTC 

1270 1290 1310 

CCCATCCCT^.i%C.AGCCACAGAAATAJKTTTATTCACA-^GTAAAAJU^GCAGTAJ>^TCAAACCT 

1330 

30 GA^^JJ%AJ^J^2uKAJ\AAAA 

35 

40 

45 

50 
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(4) 



ecegcacagc w9aca9cc9tactcag9aagc ttc tcoatcctaggccta cctccace^eg 

oagaacacaceaccegcagagacca tggggcccctc tcagcccctccc tccacececc tc 

Mc tGlyProLeuSe rAieProProCy sThrHi sLeu 

a t cecttggaagcggotcctgctcacagcatcacttttaaacttctggaatccgcccaca. 
i leThrTr pLysGlyvalLeuLeuThrAlaSerLeuLeuAsnPheTrpAsnProProThr 

actccccaaqtcaccattgaaccccagccacccaaeotttctgagggcaogcatcttctt 
ThrAlaGlnValThrlleCluAlaGlnProProLysValSerGluClyLysAspvalLeu 

ctacttgtccacaatttgccccagaatcttgctggctacatttggtacaaagggcaaatg 
LeuLeuValKisAsnLcuProGlnAsnLcuAlaGiyTyrl leTrpTyrLysGlyGlnKct 

acatacgtctaccattacattacatcatatgtagtagacggtcaaagaattata ta tggg 
ThrTyrValTyrHisTyrllcThrSerTyrValValAspGlyGlnArgllcIleTyrGly 

cctgcatacactcgaagagaaagegtatattcceatgcatccctgctgatccagaatgtc 
ProAlaTy r Se cGlyArgGluArgValTy rSerAsnAlaSerLcuLeuI leGlnAsnVal 

acgcaggaggatgcaggatcc tacaccttacaxratca taaagcgacgcgatgggactgga 
ThrGlnGluAspAlaGlySerTyrThrLcuHisIlcIleLysArgArgAspGlyThrGry 

ggag::aactgcacatttcaccttcaccttacacctgcagactcccaagccctccatctcc 
GlyValThrGlyHisPheThrPheThrLeuHisLeuGluThrProLysProSerlieSer 

agcaecaacttaaatcccaggcaggccatggaggctgtgetctteacctgtgatcctgcg 
SerSerAsnLeuAsnProArcGluAlanetGluAlavelilcLcuThcCysAspPrcAia 

actccaccc5cac5Ctaccagtg5t.9cat9aatcctca9agcctccctatgactcdcecc 
Thr?roAlaAlaScr7yrGlnTrpTtpKctAsnGlyGlnSerLeu?roMetThrHisArg 

ttcccgctgtccaaeaccaacacgaccctctttatatttgctgtcacaaegtatcttcca 
LeuGlnLeuSe rLysThrAsnArgThrLeuPhellePheGlyValThrLysTyrlleAla 

gccccctatgaatgtgaaatacgcaacccagtgactgccacccgcagtgacccactcacc 
Gly?roTycGluCysGluIleA.rQAsnProValS8rAlaSerAcgSecAspProVal7hr 

ctceatctcctcccaaagctgtccaacccctacetcacaatcaacaacttaaecccceca 
LeuAsnLeuLeuProLysLeuSerLysProTyrlleThrlleAsnAsnLeuAsnPrcArg 

gacaataaggatgtcttaaccttcacctgtgaacctaagagtgagaactacacctacett 
GluAsntysAspValLeuThcPhcThrCysGluProLysSerGluAsnTyrThrTyr 1 1 e 

tggtggctaaatggtcagagcctccctgtcagtcccagggtaaagcgaccca ttgaaaac 
TrpTrpLeuAsnGlyGlnSertcuProValScrProArgValLysArgProI leGluAsn 

ag^aucctcattctacccaatgtcacgagaeatgaaacaggaccttatcoatgtgcaate 
ArgileLeuI leLcu? roAsnVe iTh r Ar gAsnGluTh cGlyP roTy rGl nCy sGlu I le 

cgggaccgata tggtggca tccgcegtgacccagtcaccctgaatgtcctcta tcgtcca 
ArgAspArgTyrGlyGlyl leArgSerAspProValThrLeuAsnValLeuTyrGlyPro 
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1081 gacctccccegcdtttacccttcdttcacctdttaccgttcaggagaaaacctccacttt 1-40 
AspLcuProScrileTyrProSerPhcThrTyrTyrArgSerClyGluAsnLeuTyrPhe 

IKl tcctccttccctgagtc taacccacgggcacaatattcttggacaa t taatgggaagttt 12C0 
Se rCysPheGlyGluScrAsnProArgAlaGlnTyrSerTrpThrlleAsnGlvLysPhe 

1201 cacctetcacgecaaaagctctctdtcccccaaatdectacaddgcatagtgggctctat ' 2-"• 
GlnLeuSe rGlyGlnLysLcuSe r lleProGlnl IcThrThrLysHi sSe rGlyLeuTy r 

1261 gcttgctctcttcctaactcagccactggcaaggaaagctccaaatccatcacagtcaaa ^ ^2• 
Al aCysSe r valArgAsnSe rAlaThrGlyLysGluSe rSerLysSe r I leThrValLys 

1321 gtctctgactggatattaccctgaattctactagttcctccaattccattttctcccato 1350 
valScrAspTrpIleLcuProEnd 



IJ81 gaatcacgaagagcaagacccactctgttccagaagccctataatctggaggtggacaac 14 40 

1441 tcgatgtaaatttcatgggaaaacccttgtacctgacatgtgagccactcagaactcacc 1500 

1501 aaaatgttcgacaccataacaacagctactcaaactgtaaaccaggataagaagttgatg 1560 

1561 acttcacactgtggacagtttttcaaagatgtcataacaagactccccatcatgacaagg 1620 

1621 ctccaccctctactgtctgctcatgcctgcctctttcacttggcaggataatgcagtcat 1680 

1681 tagaatttcacatgtagtagcttctgagggtaacaacagagtgtcagattftgtcatctca 1740 

1741 acctcaaacttttacgtaacatctcagggaaatgtggctctctccatcttgcatacaggg 18 00 

1801 ctcccaatagaaatgaacacagagatattgcctgtgtgtttgcagagaagatggtttcta i860 

1861 taaagagtaggaaagctgaaattatagtagagtctcctttaaatgcacattgtgtggatg 1920 

.11} 9^^^^""^"tc:ctaagagatacagtgtaaa«acgtgacagtaatactcattctacca 1980 

1981 gaataaacatgtaccacatttgcaaaaaa 2010 
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(5) 



10 



75 



1 gggtggatccteggctcetctccataggggagaacacacatacagcagagaccatggga 59 

Me tGly 

50 cccctctcaccccctccctgcactcagcacatcacctggaaggggctcctgctcacagca 119 
ProLeuSerAld?ro?roCysThrGlnHislleThrTrpLysGlyLeuLeuLeuThrAla 

120 tcac ttttaeecttctggaacctgcccaccactgccceagtaataattgaagcccagcca 17 S 
ScrLeuLeuAsnPheTrpAsnLeuProThrThrAlaGlnValllelleGluAlaGlnPro 

180 cccaaagtttctgaggggaaggatgttcttctacttgtccacaatttgccccagaatctt 239 
ProLysValSerGluGlyLysAspValLeuLcuLeuValHisAsnLeuProGlnAsnLeu 

24 0 actggctacatctgg vacaaagggcaaatgacggacctctaccattacattacatcatat 299 
ThrGlyTyrlleTrpTyrLysGlyGlnMetThrAspLeuTy.rHisTyrlleThrSerTyr 

3o0 gtagtagacggtcaaattatatatgggcctgcctacagtggacgagaaacagtatattcc 359 



valValAspGlyGlnllelleTyrGlyProAlaTyrScrGlyArgGluThrValTyrSer 



20 



25 



30 



35 



40 



45 



360 aatgcatccctgctgatccagaatgtcacacaggaggatgcaggatcctacaccttacac 419 
AsnAlaSerLeuLeuIleGlnAsnValThrGlnGluAspAlaGlySerTyrThrLeuHis 

4 20 atcataaagcgaggcgatgggactggaggagt'aactggatatttcactgtcaccttatac 4 7 9 
llelleLysArcGlyAsoGlyThrGlyGlyValThrGlyTyrPheThrVelThrLeuTyr 

4 80 tcggagactcccaagcgctccatctccagcagcaecttaaaccccagggaggtcatggag 539 
SerGluThrPrcLysArgScrlleSerSerSerAsnLeuAsnProArgGluValwetGlu 

54 0 octctgcgcttcctctgtgatcctgagactccggatgcaagctacctgtggttgctgaat 599 
AlaValAr gLeuIleCysAspProGluThrPtoAspAlaSerTy rLeuTrpLeuLeuAsn 

6 00 ggtcagaacctccctatgactcacaggttgcagctgtccaaaaccaacaggaccctctat 6 59 

GlyGlnAsnLeuProMetThrKisArgLeuClnLeuSerLysThrAsnArgThrLeuTyr 

660 ctatttggtgtcacaaagtatattgcagggccctatgaatgtgaaatacggaggggagtg 719 
LeuPheGlyValThrLysTyrlleAlaGlyProTyrGluCysGlulleArgArgGlyVal 

7 20 agtgccagccgcagtgacccagtcaccctgaatctcctcccgaagctgcccatgccttac 779 

SerAlaSerArcSerAspProValThrLeuAsnLeuLeuProLysLeuProMetProTyr 

7 80 atcaccatcaacaacttaaaccccagggagaagaaggatgtgttagccttcacctgtgaa 8 39 
IleThrlleAsnAsnLeuAsnProArgGluLysLysAspValLeuAlaPheThcCysGlu 

34 0 cctaagagtcggaactacacctacatttggtggctaaatggtcagagcctcccggtcagt 899 
ProLysSerArgAsnTyrThrTyrllcTrpTrpLeuAsnGlyGlnSerLeuProvalSer 

900 ccgaggg taaagcgacccattgaaaacdggatactcattctacccagtgtcacgagaaat 9 59 
ProArgValLysArgProlleGluAsnArgi leLeuI leLeuProScrValThrArgAsn 

960 gaaacaggaccctatcaatgtgaaa tacgggaccgatatggtggcatccgcagtaaccca 1019 
GluThrGlyProTyrGlnCysGluIleArgAspArgTyrGlyGlylleArgSerAsnPro 
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1020 gtcaccctgaatctcctctdtggtccagacctccccagaatttacccttacttcecctat 1079 
valThrLeuAsnvalLeuTyrGlyProAspLeuProArglleTyrProTyrPheThrTy r 

1080 taccgttcagcagaaaacctcgacttgtcctgctttgcggactctaacccaccgocagag 1139 
5 TyrArgSerGlyGluAsnLeuAspLeuSerCysPheAlaAspSerAsnProProAlaGlu 

1140 tatttttggacaattaatgggaagtttcagcta tcaggacaaaagctctttatcccccaa 1199 
Tyr?heTrpTh:IleAsnGlyLys?heGlnLeuSerGlyGlnLysLeuPheI leProGln 



70 



75 



1200 attactacaaatcatagcgggctctatgcttgctctgttcgtaactcagccactggcaag 1259 
IleThrThrAsnHisSerGlyLeuTyrAlaCysSerValArgAsnSerAlaThrGlyLys 

1260 gaaatctccaaatccatgatagtcaaagtctct^gtccctgccatggaaaccagacagag 1319 
GluIleSerLysSerMetlleValLysValScrGlyProCysHisGlyAsnGlnThrGlu 

1320 tctcattaatggctgccacaetagagacactgagaaaaagaacaggttgataccttcatg 1379 
SerKisEnd 

13 80 aaattcaagacaaagaagaaaaaggctcaatgttattggactaaataetcaaaaggataa 14 39 

14 40 tgttttcataatttttattggaaaatgtgctgattcttggaatgttttattctccagatt .1499 
1500 tatgaactttttttcttcagcaattggtaaagtatacttttgtaaacaaaaattgaaaca 1559 
1560 tttgcttttgctctctatctgagtgccccccc 1591 



20 



2. Replizierbares rekombinantes Kloniervehikel mit einem eine Nucleinsaure nach Anspruch 1 umfassen- 
den Insert. 

25 3- Zelle. die mit einem rekombinanten Kloniervehikel nach Anspruch 2 transfizlert. infiziert Oder injizlert ist. 

4. Verfahren zur Herstellung eines Polypeptids, umfassend die Schritte 

(a) des Kultivierens der Zelle nach Anspruch 3. 

(b) des Gewinnens des durch diese Zelle exprlmierten Polypeptids. 

5. Verfahren zur Herstellung eines gegen ein Polypeptid gerichteten Antikorpers, umfassend die Schritte 

(a) des Herstellens des Polypeptids durch das Verfahren des Anspruchs 4, 

(b) des Injizierens des Polypeptids in einen Wirt, der zur Bildung von Antikorpern befShlgt ist, und 

(c) des Gewinnens der Antikorper. 

Revendications 

1. Acide nuclSique comprenant une sequence de bases qui code pour une sequence peptidique, 
caracterise en ce que le groupe d'acides nucleiques est de TADN choisi parmi le groupe de cinq 
40 sequences ci-apres : 
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10 30 SO 

CACCCGTCC rCGAACCCTTCCTCGACCCCAAGCTCTCCTCCACAGGTGAACACACCCCCA 

5 

70 90 110 

GCAGGAGAC.-.CCATGGGGCACCTCTCAGCCCCACTTCACAGAGTGCGTGTACCCTGGCAG 
JO .ne cGlyHisLcuSc cAlaProLeuHi sArgValAryValProTcpGlr. 

120 150 110 

GCCCTTC7GCTCACACCCTCACT7CTA.-vCCTrCTCCAACCCCCCCACCACTGCCCAGC-I C 
G 1 vL*-j:.e jLeuTr. : A i £ Se r LeuLeuTh : PheT r pAs n ? c c? coTh r Th r A 1 c.G i i". L 2 

19C 210 230 

20 ACTACTGAATCCATGCCATTCA-ATGT'rCCACAGCGGAAGGAGGTTCTTCTCCTrCrCCA: 
Thc7h:Clu£e rrie tProPhe AS rival AlfeGluGl vLysGluVal Leu Leu LeuVelHi s 

2S0 270 290 

AATCTGCCCCAGC.^^^CTTTTTGGCTACAGCTGGTACA.'^^^GGGGA^AGAGTGGATGGCA.AC 
AsnLeu?rcGlr.GinLeu?>.eGlyTy rSe r7:c7y cLysGlyGluArcjVaiAsoGl yAs- 



jiu 



330 350 



CGTC/.A.^7TGTACGATATGCA-ATAGCAJvCTC;vAXAJ^.CCTACCCCACGGCCCGC;>uAACAGC 
ArcGinI le valGlyTy rAla i ieClyThi GlnGlnAlaThf PccClyProAlaASfiSe r 

370 390 <10 

35 . . . - • 

GGTCGAGAGACA.nTATACCGCAATCCATCCGTGCTCATCCACA;».CGTCACCCAGAATGAC 
ClyArgCluThc I 1 cTy r P coAsnAl oSe r LeuL«rul 1 cClnAsnVa 1 Th rCl nAsnAsp 

40 ^30 4S0 470 

ACAGGATTCTACACCCTACAAGTCATAAAGTCAGATCTTGTGAATGAAGAAGCAACTGGA 
ThcClyPhcTy cThcLeuGlnVal 1 1 eLysSe c As pLeu va 1 AsnCl uG 1 uAl aTh r G 1 y 

45 
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c?: 510 520 

CACTTCCATC7A7ACCCCCACCTCCCC>JKGCCCTCCATCrCCACC/vACA.ACTCC.\.=vCCCT 
ClriPheH: svi ITy : ? :DGl-jLe'j?t'oLy s ?roSe r 1 1 eSe c Se : AsnAsnSe r Asn? : o 

::: 570 59C 

GTGCACCAC.-^V^CA7CCrCTCCCCT:CACCTG7CAACCTGACACTCAGCACACAACC:AC 
va IGluAspl/sAspAl aVaiAldPheThrCysCluProCluThrGlriAspThrThrTy r 

5iC 530 650 

75 CTCTCC7GCA7A.^v.AC.-J5.7CAGAGCC7CCCGC7CACTCCCAGGCTGCAGC7G7CC;x.'\7GGC 
LeuTf pTrpIleA.sr.AsnGlnSe r LeuProValSe cProAr^LeuGlnLeuSe c AsnGl y 



10 



20 



25 



30 



35 



40 



45 



57: 590 110 

AiACACGACCC7CAC7C7ACTCAGTG7CACAAGGAATCACACACGACCC'nvTCAGTCTGAA 
AsnAr97>. rL2u7h:Lev;I.euSe r va iThrAc gAcnAspTh cGly? coTy rCluCy sClu 

73: '^50 770 

A7ACAG;'J;CCCAG7GAG7GCGA-ACCGCAGTGACCCAG7CACCT7GA-\TGTCACCTA7CGC 
! 1 eGlnAs r. ? :cv» 1 Se rAlaAsnAc cSe r AspProVaiThr LeuA.snVcl7hrTy rCl y 



79: 010 830 

CCGGACACC::CACCA7T7CCCC7TCACACACC7AT7ACCCTCCAGGCGCAAACC7CAG: 
PrcAsp7"r;r?:o7hr : leSerProScrAspThtTyrTyrAc gProGlyAlaAsnLeuSer 

SS: 870 890 

CTCTCCTCC7ATGCAGCCTC7A^ACCCACCTCCACAGTACTCCTCCCTTATCAArCCA»ACA 
LeuSerCys7yrAleA.ldSe rAsnProProAlaClnTy rSe cTsrpLeuI 1 eAsnGlyThr 

9i: 930 950 

T7CCAGCA--.'■GCACAC>-^GAGC7CT77ATCCC7A.^CA7CAC7G7C^AT/i^^TAG7GCA7CC 
?heGlnGLnSer7hrGlnQluLeu?heIleP:oAsnIleThcVaiAsr.AsnSe:Gly5er 

97 G 9 90 ,1010 

• . . . . • 

TATACCTCCCACCCCAATA^CTCACTCACT'JCC TCCAACACCACCACACTCAACACGArC 
TycThrCysH; sAlAAsnAsnSe cValThcGlyCysAsnAcgThcThcVal Ly sTMi* lie 
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5 



1030 lOSO 1070 

ATACTCaCTCATAJ^TCCTCTACCACAAG.kAAATCCCCICTCACCTCCCCCCATIXCI 

I leVdl7^:AspAsnAlaLeu?roClnCluAsr.ClyLeuSec?coClyA.lA2 le;-.lACly 



1110 



AT7CTCAT7CCACTAGTCCCCCTCCT7CCTCTCATACCAGTACCCCTCGCATGTTT7CTC 
I level 1 1 5GlyvaIVdiAiatcuValAlaLeuIleAlcVclAlaLeuAlsCys?hfe*-e j 

lis: 1170 i i ? 0 

75 CA777CGGG.--^GACCCGCAGGGCA.\GCCACCAGCGTCATCTCACACAGCACAAACCC7CA 
H i sPheGlyLy s7hrCiyAr gAldSe cAspClnArgAspLeuThcCluKi sLysPf oSe r 

121C 1230 1250 

20 . . . . • 

G7C7CCAACCACACTCACGACCACTCCAATCACCCACCTAACAAGATCAATCAAGTTAC7 
Vd 1 Sc c AsnH i sTricClriAspHi s Sc c AsnAspP r oP r oAsnLy stte tAsnGluValTh : 

127-: 1290 1 310 

25 

rA77CTACCC7GAAC777G;>J^GCCCAGCA>i.CCCACACAACCAAC77CAGCC7CCCCATCC 
Ty rSc rTh:L2uAsn?hcCluAieClnCln?coThcGln?coThcSc rAlaSc r ?roSc c 

30 133 : 1 350 1370 

C7a.acagccacacaaataatttattcagaj;gta>.^aaagcagtaatga.\.acctgtcctgc 

L.cuTh:Ala7h:GluI lelleTy cSc rCiuVelLysLysGin 

139C KIO K30 

• • * * ' . * 

TCACTGCAG7GCTGATCTAT7TC/VvGTCTC7CACCC7CATCACTAGGAGA7TCCTt7CCC 

1450 1470 1490 

C7G7AGGG7AGAGGGG7CCGGACAGAAACAAC777C7CCTAC7C77CC77CC7AA7AGGC 

-^5 1 5 1 0 1 5 30 ISSO 

ATC7CCAGGC7GCC7GGrCAC7GCCCCrCTC7CAGTG7C/vATAGA7GAAAGTACA77GGG 



40 



50 



1570 1S90 1610 

AG7CTGTAGG/w\ACCCAACCT7C77G7CA77GAJWVTTrCCCAAAGC7CAC7TTCGGAAAC 



55 
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152: 1650 leio 

ACGGACCAC-i.-.CTTCCCCTCCCTTCCCCTTTTCCCAACCTCCACTTCTTTTiVtACTT3CC 

5 

169: mo 1730 

TCTTCAGACCACTCATTCCTTCCCACCCCCACTCCTCTCCrATCACTCT.AATrCCCATTT 

10 

1750 1770 1790 

CCCATAGCCTTCACG7TATCTCCTTTTCCATTAACTACA7GTCCCAGGAAACACCCAGAC 

75 

laiO 1830 1850 

ACAGA.V\GT.y'-^CGCCAGTAJ^TGC7TCTCCTATTTC7CC;^JkACCCTTGTGTCAACTACCA 

20 

137G 1890 1910 

AACAGAAGA.\AA7CA.\.ATATATAACCAATAGTCAAATCCCACACGTTTGTCCACTGTCAG 

25 

1930 1950 1970 

GGTTGTCTACC7GTAGGA7CAGGC7CTA.\GCACC77GG7GCT7AGCTAGAATACCACCTA 

1990 2010 2030 

A7CCTTC7GCC.-J^CCCrC7CTTCAGAGAACCCAC7AGAACCAACTAGCAAAAA7CACTTG 

2050 2070 2090 

CCAAAA7CCAACCC^JVTTCCTCATCGAAAATCC;xAAACCACATATA7CT7TTAA7A7CT7 

2110 2130 2150 

7A7GGGC7C7G77CAAGGCACTCC TCACACGCAGCCCTTATACCTTCAGGACGGAACCAG 



^® 2170 2190 2210 

CTTCTCAT>^qAC/^J^7CTGCTAGGAACTTCCCAAACCAATCAGACACC7CCCCTTCAGC 
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223: 2250 2110 

CATTAT7T>^^TTGTT.vAGPun7ACAC/\ATTTGGCCTA7TCGCAT7T7TCTCCTT7TCTC 

5 

:::: :3'io 2220 

TG A C A C A 7 T 2 C A C C A 7 7 7 7."-'^ 7 7777G 7.-J^.C 7G C T7 A777 A7C 7GAA^-AG G G7 7 A777 7 7 

70 

235: 2270 2390 

AC77AGC77AGC7A7G7CAGCCAA7CCGATTGCC7TAGC7GAAAGAAACCACCGAA;x7CC 

15 

2410 2'»30 • 2450 

C7CAGG7CCC77GG7CAGGAGCCTCTCAAGATT77TTTTGTCAGAGGCTCCA^ATAGA.\A 

20 

2470 2490 2510- 

. . . . • • 

ATAAGA-^AACCiTTTCT7CA77CA7CCC7AGACCTACATTT.\ACTCAC7T7CTACCCACC 

25 30 2550 2570 

. • • • • • 

7CACACCAA7CA7C.^AC7ACCAT7C7ATTCCATCTT7CCACC7CTGCA7T77CTC7T7CC 

^ 25S0 2610 2630 

CCCCA77CAC777C7CAGGAAACC7TCGCC7C7GCTAAGG7G7AT77CC7CCTTCACAAC 

^ 2650 2670 259C 

7GGGAGCACCC7ACAGGGACAC7A7CACTCATGCTGGTGCCATTG777ACACC7AGr-^AG 

2710 2730 2750 

C7GCACTGG7GC7A_\7GCCCC7TCCGA;\A7GGGCC7GTCACCACGACCA77A7/JsC7TAG 

45 2770 2790 2310 

CCC7;-.CCCTCTT7rAJi.CAGCCTCrCAAAT7TATCT7TTCTTCTATCCCGTC7AT/vAA7CT 

SO 26 50 2050 2370 

J . • • • ' 

ATCTTATAATAy^AAGCAACCACAGGAGCAACACACCCAAA7CTACTTC7CACCCAG7CT 
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:5;0 2910 2930 

TCTACACACArCGAATCTCTTTCGGCCTAACACAAACCTTTTATTCTATATTCfTTACCT 

5 

2930 2970 2990 

CATCTCATGTTACGCCT.2^JVCAGCCTTTCTCCACGAGCA7TACCTTGCAGTTCTCTATACT 

10 

3010 3030 3050 

CAGCTACCTCTTTCAGGCTTTTCTAACCCTGACACCGACTGTGCATACTTTCCCTCATCC 

75 

307: 3090 3110 

ATGCTGTGCTGTGTTATTTAATTTTTCCTGGCTAAGATCATGTCTGA/tTTATGTATGAAA 

20 

3130 3150 3170 

ATTATTCTATGTTTTTATAATAAAAATAATATATCAGACATCCAAA/vA-AAAAA , 

25 
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(2) 



i: 30 50 

* " • . 

CACCCCTCCT-GAAGCGTTCCTCCACCCCAACCTCTCCTCCACACCTGAAGACAGCCCCA 

SO 110 

CCAGGAGACA:CA7GGGGCACCTCTCAGCCCCACTTCACACAGTGCCTGTACCCTCGCAG 
netGlvHisLeuSerAUProLeuHisAcgvalAf gValProTrpCin 

12: 150 170 

GGGCT7C7CCTCACACCC7CACTTCTAACCTTCTGGAACCCGCCCACCACTGCCCAGCTC 
GlyLeuLeuL.5j7hrAlaSerLeuLeuThcPheTrpAsnPcoProThfThrAlaGlnLeu 



190 210 i30 

• * • • * 

AC7AC7GAA7CCArGCCA77CAA7GTTGCACAGGCGAACGAGGTTCT7CTCCTTG7CCAC 
ThrThrGluSsrnetProPheAsnValAlaGluGlyLysCluValLeuLeuLeuVdlHis 



233 270 290 

AJV7C-GCCCCAGC.VACT7TT7GGC7ACAGCTGGTACAAAGCGCAAAGAC7GGATCGC.^.^C 
Asr.LeuPrcClr.GlnLsuPheClyTyrSecTcpTyrLysGlyCluArgVaiAspGlyAsn 



312 330 350 

• . , ^ 

CG7C.-_i_i.7TGTAGGA7A7GCAATAGCAACTCAACAAGCTACCCCAGGGCCCGCAAACAGC 
ArcGlnllsValGly7yrAlaIleGlyThrClnGlnAlaThr?roGly?coAlaAsnSer 



370 390 410 

GG7CGAGAGACAA7ATACCCCA-ATGCATCCCTCCTGA7CCAGAACCTCACCCAGAATGAC 
ClyAc5GluThrlleTycProAsnAlaSerLeuLeuIleClnAsnVal7hrGlnAsnAsp 
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ACACCA7TCTACACCCTAC.s^CTCA7AAAC7CACATCTTCTCAA7CAAGAAGCAACTCGA - . 

7h:Gly?heTy :Thr'-euGlrtVal lieLysSerAspLeuVaiAsnGluGluAiaThrGly 

<SC 510 530 

CAGT7CCA-CTATACCCGGAGC7GCCCAAGCCCTCCATCTCCAGCAACAACTCCAACCCT 
Glr.?heHisVdlTyr?rcGluLeuPcoLys?roSe rlleSecSe rAsnAsnSe rAsnPro 



530 570 590 

. • ■ • ' 

GTGGACGACX5.GGATCCTGTGGCC7TCACCTGTGAACCTCACACTCAGGACACAACCTAC 
valGluAspLy sAspAlaValAla?h8ThrCysGlu?coGluThrGlnAsoThcThrTy r 



6i0 630 650 

20 CTGTGGTGGATAAACAATCAGAGCCTCCCGCTCAGTCCCAGGCTGCAGCTGTCCAATGGC 
LeuTrpTrpIlcAsnAsnGlnSecLeuPcoValSecProAcgLeuClnLcuScrAsnCly 



670 690 710 

• ■ • • . • • 

AACAGGACCCTCAC7CTACTCAGTG7CACAAGGAATCACACACGACCCTATGAG7GTGAA 
AsnArgThr Lej7hrLeuLcuSecVdl7hrAC9AsnAspThrGly?f07yrGluCysGlu 



730 750 770 

• * • • • • 

A7ACAGAACCCAG7GAGTGCGAACCGCAG7GACCCAG7CACC7TGAATGTCACC7ATGGC 
I leGlnAsnProValSe rAlaAsnArgSerAsp?coValThrLeuAsnVai7hc7y rGly 



790 810 830 

• • • . • 

CCGGACACCCCCACCA77TCCCC77CAGACACC7A77ACCG7CCAGGGGCAAACC7CAGC 
ProAsp7hr ProThr i IcScrPcoSc cAspThr7y c7y cAr gProGlyAlaAsnLcuSe r 

40 
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S;.j B70 8S0 

ctctcc7c:tatgc.-.gcctcta.:;cccacctccacagtactcctggcttatcaatgg.--^ca 

LeuSe rCysT/rAliAlaSe rA-s-nPro?roAieCln7y cSerTrpLeuIleAs-nGlyT^r 

930 9S0 

ttccagc.-.--^gcacac;^j^gagctctttatccctaacatcactgtgaat;^atagtggatcc 

PheGlnClr.se rThrGl nGluLeuPhe I leProAsnllcThr valAsnAsnSe cGlySe r 



970 990 1010 

. • - • • 

TATACCTGCCACGCCAATAACTCAGTCACTGGCTGCAACAGGACCACAGTCAAGACGATC 
Ty cThrCysHisAlaAsnAsnSe cValThrGlyCysAsnArgThrThrValLysThr I Ic 



1030 1050 1070 

• • • • 

ATAG7CACTGAGCTAAGTCCAGTAGTAGCAAAGCCCCAAJVTCAAAGCCAGCAAGACCACA 
IleValThrGiuLeuSerProValValAlaLysPcoClnlleLysAlaSecLysThrThr 



1C9C 1110 1130 

• • • * • 

CTCACAGGAGATAAGCAC7C7GTGAACCTCACC7GCTCCACAAA7GACACTGGAA7CTCC 
val7hrGlyAs?LysAspSe rValAsnLeu7hcCysS€C7hcAsnAsp7hrGlyIleScr 



1150 1170 1190 

• • • . • • 

ATCCCTTGCTTCTTCAAAAACCACAGTCTCCCCTCCTCGGAGAGGATGAAGCTCTCCCAG 
IleAcgTrpPhePheLysAsnClnSecLeuPtoSecSecGluAcgttetLysLeuSerGln 



1210 1230 1250 

. . • • " " 

GGCAACACCACCC7CAGCA7AAACCC7GTCAACAGCGAGGA7CC7GGGACC7A77GG7G7 
GlyAsn7hr7hrLeuSecIlcAsnPcoValLysArgCluAspAldGly7hr7ycTcpCys 



An 



EP 0 346 710 B1 



s 



1290 1310 

.CCCAATCAGTAJkGAACCA/\ACCCACCCCATCATCCTCAACGTAAACTAT 
Gl-jVal?neA3-?roIleSerLvsAsnClnSerAsoProIler«ctLeuAsnValAsnTvr 



133: 1350 1370 

• • . . 

A.-.7GCTCTACCACA^G.i-fiukATGGCCTCTCACCTGGGCCCATTGCTGCCAT7GTGATTGGA 
Asr.AlaLe-j? roGinGluAsnGlyLeuSe cPcoGlyAlal 1 eAlaGly I leVal I leGly 

13S0 1410 1430 

C7AGTGCCCCTGG77CCTCTGATAGCAGTAGCCCTCGCATGTTTTCTGCATTTCGGGAAG 
ValValAlaLeuValAlaLeuIleAlaValAlaLeuAlaCysPheLeuHisPheGlyLys 

1450 1470 1490 

20 

ACCGGCAGC7CAGGACCACTCCAATCACCCACC7AACAAGA7GAA7GAAC77AC77A77C 
7hrGlyScrSsrGly?roLeuGln 

25 1510 1530 1550 

• • • . 

7ACCC7CAAC777GAJkGCCCAGCAACCCACACAACCAACTTCAGCCTCCCCA7CCC7AAC 

30 1570 1590 1610 

* 

AGCCACAGAA.A7AAT7TATTCAGAAGTAAAAAAGCACTAATGAJiJlCC7GAAJi.iJ^A^^ 

35 1630 
AAAJ^JUSuAAAA 
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(3) 



3 0 50 

/*.'«:CV. ^ • o w • ^ w>*.-~<-«:s. Vii r% U w w w/Vr^O w I w ^ v. U & k ^ z*;^ L> L Vji w \. * V. \^ r\ 

10 

Ij 90 110 

CCACCACACACCATCCCCCACCTCTCAGCCCCACTTCACAGAG7GCGTGTACCCTGGCAC 
r.e :G lyH i s LeuSe r Al a ? r cLcuH i sAcgVa 1 AC gva 1 ? c oTr pel n 



110 150 170 

GGGCTTCTGCTCACAGCCTCACTTCTAACCTTCTGGAACCCCCCCACCACTGCCCACCTC 
GlyLeuLeu'-euT^rAlaSe cLeuLeuThrPhcTcpAsn? coPcoThcThrAlaGlnLeu 



190 210 230 

25 ACTACTCAATCCATGCCATTCAATCTTCCAGAGCCCAAGCAGCTTCTTCTCCTTCTCCAC 
ThrThrGiuSs cf-.e t ?ro?hcAsnVa 1 Ai aGluGl y Ly sGluVa 1 LeuLe uLcuVa 1 H i s 



250 270 290 

30 ...... 

AA7CTGCCCCAGCA.-CTT?TTGGC'rACAGCTGGTACAAAGGGGAAAGAGTGGATGGC;---.C 
AsnLeu? ccGlr.Glr.ls jrheGlyTy :Se rTrpTy r LysClyGluArgVaiAspGly.^.sr. 



35 3i;- liO 350 

CGTC.iJV.A7TGTAGGATATCCA.ATAGGAAC7C.\ACA-t.GCTACCCCAGGGCCCGC/^A.ACA 
ArgClnl ieVaiGiy7y rAlal leGly7hrClnGlnAlaThr ?coCly?roAlaAsn£8 r 



370 3S0 <10 

GG7CGAGACACA-A7A7ACCCCAAVGCA7CCCTGC7GA7CCACAACCTCACCCAGAA.TGAC 
GlyA r5Glu7hc 1 1 •7y r P roAsnAl a Se r LeuLeuI lcGlnAsnVal7hcGlnAsr..Asp 



430 450 470 

ACAGGAT7CTACACCCTACA>.G7CA7AAAG7CAGATCT7GTGAA7GAAGAAGCAACTGGA 
7hfGlyPhc7y c7hcLcuGlnVaII 1 c Ly s Se r AspLcuVa 1 AsnCl uGl uAl a7h r G 1 y 
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510 530 

CACTTCCATCTATACCCCCAGCTCCCCAAGCCCTCCATCTCCACCAACAACTCCA^^^^ 
Cln?heKi sVc iTy : ?roCluLeu? roLysProSc r I IcSe cSe cAsnAsnSe r Asn? :*o 

5 

550 570 590 

CTGCAGCACA.i.GCA7CC7CTCCCCTTCACCTGTGAACCTCAGAC7CACGACAC;\ACCTAC 
10 VaiGluAsoly sAspAlaVa lAlaPheThrCysGluProGluThcGlnAspThrTh :Ty r 



510 630 650 

T5 CTGTCGTCGATAAACA-ATCAGAGCCTCCCCCTCAGTCCCAGGCTGCAGCTCTCC;^J5.TCCC 
LeuTrpTrpIl eAsnAsnGlnSe r LeuProValSe cProAcgLeuGlnLeuSe cAsnGly 



670 690 710 

20 • • 

AACAGGACCCTCACTCTACTCACTCTCAC/vAGGAATGACACAGCACCCTATGACTGTGAA 

Asr.ArgTh r Ls v:Th cLcuLeuSc cValThcAr^AsnAspThcGlyP roTy rGluCy sGlu 



73G 750 770 

25 

ATACAG.^CCCAGTGAGTGCGAACCGCAGTGACCCAGTCACCTTGAATGTCACCTATGGC 
1 IcGlnAsnPf oValSecAlaAsnArgSerAspPcoValThrLcuAsnValThrTyrCly 



30 

" : 810 3 3 0 



35 



CCCGAJAC: :r rACCATTTCCCCrrCAGACACCTATTACCGTCCAGGGGCAAACCTCAGC 
? : cA s pTh : ? : c Tr. : 1 1 e-S e r? r cS e A s pTh-r Ty r Ty c A c c ? i* oG 1 y A i a As r. Le u S£j 



370 690 

::A:*CCAGCCTCTAJ^.CCCACCTGCACAGTACTCCTGGCTTATCA.ivTGG.->.->CA 
^ Le-jSe :Cv5 7v;Ai eAl & Se rAsnProPcoAlaGlnTvrSe cTroLeuI leAsnGlvThr 



910 930 950 

TTCCAGCAA.iwGCACACAAGACCTCTTTATCCCT/vACATCACTGTGAATAATAGrGGATCC 
P^eGlnGlr.Sc f T^^ rGlnGl uLeuPhc I Ic? roAsnl 1 cTh c Va 1 AsnAsnSe rGlySc r 



50 



970 990 1010 

* . 

TATACCTGCCACGCCAATAACTCACTCACTCCCTCC/v/^CAGCACCACAGTCAAGACGATC 
TycT^rCysHlsAlaAsnAsnSe c Va ITh cClyCy $A$nA c gTh cTh r Va 1 Ly sTh c lie 
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1030 1050 lO'O 

T>GTCACTGA?.V.-CCTCTACCACA.AG.WJ!.TCCCCTCTCACCTCCCCCCATTCCTCCC 
ievalThrAs?.-.3r.AUL8u?roClnCluAsr.ClyLeuSe c?foGlyAUlleAlaCly 

iOSO 1110 1130 

ATTCTGATTGCACTACTCCCCCTCCTrCCTCTCATACCAGTAGCCCTCCCATCTTrrCTC 
I levali leGlyvalvelAlaLeuValAleLeuIlcAlaValAlaLeuAlaCysPheLeu 

liSC- 1170 1190 

CATTTCGCGAAGACCGGCACCTCACGACCACTCCAATGACCCACCTAACAAGATGAATCA 
HisPheClyLysThcGlySecSecClyProLeuCln 

1210 1230 1250. 

AGTTAC7TA7TCTACCCTCAACTTTCAAGCCCACCAACCCACACAACCAACTTCAGCCTC 

1270 1290 1310 

CCCATCCCTik-SwCACCCACACAAATAATTTATTCACAACTAAAAAACCACTAATCAAACCT 
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(4) 



I aca9caca9cr9aca9CC9taCwCag9aaacttct9gatcctaggctudtctccaca9e9 60 

5 

5 1 Qacaacacac22ccaccagagacca tgcgccccctc tcagcccctccc tccacacacc tc 1 2 v 

.'■ic cGly? rcLeuSe r Aia?ro?coCy sTh cH: sLe-j 

121 a tcact wccaarccgctcc tgcicacaccatcacttttaaacttc tggaatccgcccaca 1=0 
llethrTr pLysGlyValLeuLeuThrAlaSe cLeuLeuAsnPheTrpAsnProProTh: 

id 1 ac tccccaaczcacca ttgaaccccagccacccaaagtttctgagggcaagca tgttctt 2<: 
ThrAlaGInValThrlleGluAlaGlnProProLysValSecGluGlyLysAspValLeu 

2< 1 ctacttgtccacaatttgccccagaa tcttgctggctaca tttggtacaaagggcaaa tg 300 
LeuLeuValH: sAsnLeuProGlnAsnLeuAlaGiyTy r IleTrpTy r LysGl yGlnMe t 

15 ^ t 

5:^1 acatacgtctaccattacattacaccatatgtagtagacggftcaaagaattata ta tggg 3 60 
^ ThcTyrValTyrKisTyrlleThrSerTyrValValAspGlyGlnArgllelleTyrGly 

361 cctgcatacagtcgaagagaaagagta tattccaatgca tccctgctg> tccagaatgtc 4 20 
PcoAlaTy rSecGlyAcgGluArgValTyrSerAsnAlaSerLeuLeuIleGlnAsnVal 

< 21 acgcaggaggatgcaggatcc tacaccttacacatcataaagcgdcgcgatgggactgga 4 30 
ThrGlnGluAspAlaGlySerTycThrLeuHisIlelleLysAcgAcgAspGlyThrGry 

481 ggagtaactggacatttcaccttcaccttacacctggagactcccaagccctccatctcc 540 
GlyVaiThrGlyHisPheThrPheThrLeuHisLeuGluThrProLysProSerlleSe r 

25 541 agcagcaacttaaatcccaggcaggccatggaggctgtgatcttaacctgtga tcctgcg 600 
SerScrAsnLeuAsnProArgCluAlanetGluAlaVallleLeuThcCysAspPrcAia 

501 actccagccgcaagc waccagtggtgcatgaatggtcagagcctccctatgactcacacc 6 60 
ThrProAlaAlaScrTyrGlnTrpTcpMetAsnGlyGlnSerLeuProMetThrHisArg 

30 (^1 ttccagctgtccaaaaccaacacgaccctctttatatttgctgtcacaaagtatattcca 7 20 
LeuGlnLeuSe rLysThrAsnArgThrLeuPhellePheGlyValThrLysTyrlleAia 

721 gcaccctatgaatgtgaaatacggaacccagtgagtgccagccgcagtgacccactcacc 780 
GlyPcoTyrCluCysGlutleArgAsnProValSerAlaSerArgSecAspProValThr 

35 731 ccgaatctcctcccaaagctgtccaagccctacatcacaatcaacaacttaaaccccaca 840 
LeuAsnLeuLeuProLysLeuSerLysProTyrlleThrlleAsnAsnLeuAsnProArg 

341 gacaa taagga tgtcttaaccttcacctgtgaacctaagagtgagaactacacc taca - t 900 
GluAsnLysAspValLeuThc?he7hrCysGlu?:oLysSerGluAsnTy rThrTy r 1 1 e 

40 9 01 tggtggctaaa tggtcagagcc tccctgtcagtcccagggtaaagcgaccca ttgaaaac 960 
TrpTrpLeuAsnGlyGlnSe r Leu? roValSe rP:cArgValLy sAr gp coi leGluAsn 

96 1 ocsaccctcattctacccaatqtcacgogaaatgaaocaggaccttotcaatgtgsaatc 1020 
ArcZleLeulleLeuPcoAsnVa iTh c A c g AsnGl uTh cCly P r oTy rGl nCysCl u I le 

45 cgggaccgatatggtggca tccgcagtgacccagtcaccctgaatgtcctctatggtcca 108 0 

ArgA£pAcgTy cClyGlylleArgSerAspPcoValThcLeuAsnValLeuTyrGlyPro 
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10 



1081 qacctccccecca tttacccttca ttcacctat taccgt tcaggagaaaacctctacttt I KO 
AsoLcuPrcSer IleTyrProSerPh'.ThrTyrTyrAcgSe rGl yGluAsnLeuTy r Phe 

ccc:.9CttccctcdgtctaacccacgggcacaawOttcttggocaattdatgggaegttt 1209 
Se rCysPhcGlyCluSerAsnProArgAlaGlnTyrSerTcpThr I leAsnGiyLysPhe 

1201 cacctatc2Cw2caaaacctctctatcccccaaataactacaaagcatagtgcgct.ctat 125D 
GinLeuSe rC-lyCl nLys LeuSe r 1 1 cP roGlnl leTh rTh c LysH i s Se cGiyteuTy c 

1261 gcttcctccczicctaactcagccacCQgcaaggaaagctccaaatccatcacactcaaa 13 20 
AlaCysSerValrtCcAsnSerAlaThrGlyLysGluSerSerLysSerlleThrValLys 

13 21 gtctctgactcgatattaccctgaattctactagttcctccaattccattttctcccatg 1 360 
Va i s e r A s ?T r p I 1 e Le u P r o End 

|i81 gaatcacgaagagcaagacccactctgttccagaagccctataatctggaggtggacaac 1440 

'5 X4 41 tcgatgtaaatttcatgggaaaacccttgtacctgacatgtgagccactcagaactcacc 1500 

1501 aaaatgttcgacaccataacaacagctactcaaactgtaaaccaggataagaagttgatg 1560 

1561 acttcacactgtggacagtttttcaaagatgtcataacaagactccccatcatgacaagg 1620 

1621 ctccaccctctactgtctgctcatgcctgcctctttcacttggcaggataatgcagtcat 1680 

1681 tagaat.ttcacatgtagtagcttctgagggtaacaacagagtgtcagatatgtcatctca 1740 

1741 acctcaaacttttacgtaacatctcagggaaatgtggctctctccatcttgcatacaggg 1800 

20 1801 ctcccaatagaaatgaacacagaga tattgcctgtg tgtttgcagagaagatggtttcta 1860 

18 61 taaagag taggaaagctgaaattatagtagagtctcctttaaatgcacattg tg tggatg 192 0 

1921 gctctcaccatttcctaagagatacagtgtaaaaacg tgacagtaatactgattctagca 1980 

1981 gaataaacatgtaccacatttgcaaaaaa 2010 



25 
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1 gga&9gatcri3g9ctcatc&ccataggggagaacacacatacagcagagaccatggga 59 
g ' MetGly 

50 cccctctcaccccctcccwgcactcagcacatcacctggaaggggctcctgctcacagca 115 
ProLeuSe rAla?ro?roCysThrGlnHislleThrTrpLysGlyLeuLeuLeuThcAla 



70 



120 tcacttttaaac wtc cgcaacctgcccaccac tgcccaagtaa taa ttgaagcccagcca 11 y 
SerLeuLeuAsnPheTrpAsnLeuProThrThrAlaGlnValllelleGluAlaGlnPro 

180 cccaaagtttctgaggggaaggatgttcttctacttgtccacaatttgccccagaatctt 239 
ProLysValSe rOluGlyLysAspValLeuLeuLeuValHisAsnLeuProGlnAsnLeu 

240 actggctaca wc egg wacaaagggcaaatgacggacctctaccattacattacatcatat 299 
ThrGlyTyrI leTrpTyrLysGlyGlnMetThrAspLeuTy.rHisTyrlleThrSerTyr 

3pO gtagtagacgctcaaattatatatgggcctgcctacagtggacgagaaacagtatattcc 359 
ValValAspGlyGlnllelleTyrGlyProAlaTyrSerGlyArgGluThrValTyrSer 



20 



25 



3 60 aatgca tccctsctgatccagaatgtcacacaggaggatgcaggatcctacaccttacac 419 
AsnAlaSerLeuLeuIleGlnAsnValThrGInGluAspAlaGlySecTyr-ThrLeuHis 

4 20 a tcataaagccaggcga tgggactggaggagtaactggatatttcactgtcaccttatac 4 79 
IlelleLysArgGlyAsoGlyThrGlyGlyValThrGlyTyrPheThrValThrLeuTyr 

4 80 tcggagactcccaagcgctccatctccagcagcaacttaaaccccagggaggtcatggag 539 
SerGluThrPrcLysArgSerlleSerSerSerAsnLeuAsnProArgGluValMetGlu 

540 gctgtgcgcttaatctgtgatcctgagactccggatgcaagctacctgtggttgctgaat 599 
AlaValArgLeuIleCysAspPcoGluThrPcoAspAlaSerTycLeuTrpLeuLeuAsn 

600 ggtcagaacctccctatgactcacaggttgcagctgtccaaaaccaacaggaccc tctat 659 
GlyGlnAsnLeuProMetThrKisArgLeuGlnLeuSe cLysThrAsnArgThrLeuTyr 

30 

£>60 ctatt.tggtctcacdaagtatatt9ca9ggccctatgaatgtgaaatacggaggggagtg 719 
LeuPheGlyValThrLysTyrlleAlaClyProTycGluCysGlulleArgArgGlyVal 

720 agtgccagccgcagtgacccagtcaccctgaatctcctcccgaagctgcccatgccttac 779 
SerAlaSecAcgSerAspProValThcLeuAsnLeuLeuPcoLysLeuProHetProTyc 

35 

7 80 atcaccatcaacaacttaaaccccagggagaagaaggatg-tgttagccttcacctgtgaa 839 

IleThcIleAsnAsnLeuAsnProAcgGluLysLysAspValLeuAlaPheThrCysGlu 

8 40 cctaagagtcggaactacacctacatttggtggctaaatggtcagagcctcccggtcagt 899 

ProLysSerArgAsnTyrThrTyrlleTrpTrpLeuAsnGlyGlnSerLeuProValSec 

40 

9 00 ccgagggtaaagcgaccca ttgaaaacagga tact cat tctacccagtgtcacgagaaat 959 

ProArgValLysArgProI leGluAsnArgl leLeuIleLeuProSe rValThrArgAsn 



45 



96 0 gaaacaggaccctatcaatgtgaaa tacgggaccgatatggtggcatccigcagtaaccca 1019 
GluThrGlyProTycGlnCysGluIIeArgAspArgTyrGlyGlylleArgSerAsnPro 
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5 



70 



75 



20 



1020 a^caccc tcaatcicctctatgctccagacctccccaaaatttaccct tact tea cc tat 107 9 
VaiThrLeuAsnvalLeuTyrciyProAspLeuPcoArglleTyrProTyrPheThrTyr 

1 OSO taccgttcaccacaaaacctcgacttgtcctgct ttgccgactctaacccaccggcacag 1159 
TyrArcSerGiyGluAsnLeuAspLeuSerCysPheAlaAspSecAsnProPcoAlaGlu 

II AO tacttttccacaattaatgggaagtttcagctatcaggacaaaagcwCtttatcccccaa 1199 
Tyc?heTrpTr.:I leAsnGlyLysPheGlnLcuSerClyGlnLysLeuPhelleProGln 

120 0 attactacaaatcatagcgggctctatgcttgctctgttcgtaactcagccactggcaag 12 5 9 
IleThcThrAsr.HisSerGlyLeuTyrAlaCysSerValArgAsnSerAlaThrGlyLys 

126 0 gaaatctccaaetccatgatagtcaaagtctctggtccctgccatggaaaccagacagag 1315 
GluIleSerLysSernetlleValLysValSerGlyProCysHisGlyAsnGlnThcGlu 

13 20 tctcattaatcgctgccacaatagagacactgagaaaaagaacaggttgataccttcatg 1379 

Se rHisEnd 

1380 aaattcaagacaaagaagaaaaaggctcaatgttattggactaaataatcaaaaggataa 1<39 

14 40 tgttttcataatttttattggaaaatgtgctgattcttggaatgttttattctccagatt 14S9 
1500 tatgaactttttttcttcagcaattggtaaagtatacttttgtaaacaaaaattgaaaca 1539 
1560 tttgcttttgctctctatctgagtgccccccc 1S91 



2. Vehicule de clonage recombinant apte a une replication, comportant un produit d'insertion comprenant 
un acide nuclSique selon la revendication 1. 

25 3. Cellule qui a ete transfectee, infectee par un vehicule de clonage recombinant selon la revendication 2, 
ou a laquelle on a injecte ce dernier. 

4. Proc^dS pour preparer un polypeptide, ledit proced^ comprenant les Stapes consistant k : 

(a) cultiver la cellule selon la revendication 3, et 
30 (b) recuperer le polypeptide exprime par ladite cellule. 

5. Procede pour preparer un anticorps dirige centre un polypeptide, ledit proc6d6 comprenant les etapes 
consistant a : 

(a) preparer ledit polypeptide par le proc^dd selon la revendication 4, 
35 (b) injector ledit polypeptide dans un hote capable de produire des anticorps, et 

(c) recuperer lesdits anticorps. 
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